Document made available under the 
Patent Cooperation Treaty (PCT) 



International application number: PCT/EP05/003715 
International filing date: 08 April 2005 (08.04.2005) 

Document type: Certified copy of priority document 

Document details: Country/Office: DE 

Number: 1 020040 18608.1 

Filing date: 16 April 2004 (16.04.2004) 



Date of receipt at the International Bureau: 23 May 2005 (23.05.2005) 



Remark: Priority document submitted or transmitted to the International Bureau in 
compliance with Rule 17.1(a) or (b) 




World Intellectual Property Organization (WIPO) - Geneva, Switzerland 
Organisation Mondiale de la Propriete Intellectuelle (OMPI) - Geneve, Suisse 



PCT/EP200 5/0037-1 9 

BUNDESREPUBLIK DEUTSCHLAND 




Prioritatsbescheinigung uber die Einreichung 
einer Patentanmeldung 



Aktenzeichen: 

Anmeldetag: 

Anmelder/Inhaber: 



10 2004 018 608.1 



16. April 2004 



Aventis Pharma Deutschland GmbH, 
65929 Frankfurt/DE 



Bezeichnung: 



Dual-transfected cells lines as in vitro screening tools 
for pharmaceutical compound profiling: A model for 
heptobiliary elimination 



IPC: 



C 12 N 5/06 



Die angehefteten Stucke sind eine richtige und genaue Wiedergabe der ur- 
sprunglichen Unterla^en dieser Patentanmeldung. 



Munchen, den 8. Marz 200} 
Deutsches Patent- untTMarkenc 
Der President 

Im Auftrac 




Aventis Pharma Deutschland GmbH 



DEAV 2004/0025 



Dr. RS/Ba 



Dual-transfected cells lines as in vitro screening tools for pharmaceutical compound 
5 profiling: A model for heptobiliary elimination 

A cell line transfected with both human NTCP and human MRP2 suitable as an in 
vitro tool for pharmaceutical compound profiling: A model for hepatobiliary 
elimination 

10 

The invention is referring to several dual-transfected cell lines expressing human 

*NTCP (Na/taurocholate Cotransporting Protein; SLC10A1) together with human 
BSEP (Bile Salt Export Pump; ABCB1 1) or human MRP2 (Multidrug Resistance 
Protein; ABCC2) suitable as an in vitro tool for pharmaceutical compound profiling 
15 particularly as a model for hepatobiliary elimination. 

Transcellular transport processes across biological barriers are playing an essential 
role in the absorption, distribution and elimination of drugs and xenobiotics. In 
intestinal epithelial cells transport proteins are involved in the absorption or exclusion 
20 of drugs into/from the systemic circulation. Transport proteins expressed in the cells 
of the blood-brain-barrier are involved in the excision of toxic (and also therapeutic) 
compounds from the brain. In kidney proximal tubule epithelia cells and in 

• hepatocytes transport proteins are involved in the elimination of various endogenous 
and exogenous substances. In the hepatocyte this elimination process is mediated 
25 by (i) uptake from blood into the cell via the basolateral membrane and (ii) 
elimination into the bile via the apical membrane. Paracellular diffusion in all 
biological barriers is anticipated by the. formation of tight junctions (molecular 
structures providing a very tight contact between adjacent cells) separating the 
apical from the basolateral membrane compartment. Cell lines that have maintained 
30 the features of a biological barrier (development of basolateral and apical membrane 
compartments separated by tight junctions) are also called polarized cell lines. 
Usually these cell lines grow in a single cell layer designated as monolayer. 
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The basolateral (also called sinusoidal) uptake of compounds into the human 
hepatocyte is mediated by a variety of different transport proteins: Bulky organic 
anions are taken up by OATP8 (SLC21A8), OATP2 (SLC21 A6, also known as 
OATP-C or LST1) and OATP-B (SLC21A9) whereas bile acids and possibly other 

5 compounds similar to bile acids are accepted by the Sodium Taurocholate Co- 
transporting Polypeptide NTCP (SLC10A1). OAT2 (SLC22A7) imports small organic 
anions into the hepatocyte and OCT1 (SLC22A1) has been reported to facilitate the 
cellular uptake of small organic cations. At the apical (also called canalicular) 
membrane, the elimination of organic anions and bile salts into bile is mediated by 

10 the Multidrug-resistance Related Protein MRP2 (ABCC2) and the Bile Salt Export 
Pump BSEP (ABCB1 1 ), both members of the ABC transporter family (ATP- 

•dependent export pumps). Other ABC Transporters also known to be expressed at 
the canalicular membrane of the human hepatocyte are the Breast Cancer 
Resistance Protein BCRP (ABCG2, also known as MXR) and the Multidrug- 
15 resistance protein MDR1 (ABCB1 , also known as P-glycoprotein). These two 

proteins are thought to participate in the canalicular secretion process of amphiphilic 
and cationic compounds. 

The multidrug resistance protein (MDR) family mediates the ATP-dependent 
20 unidirectional transport of conjugates of lipophilic substances with glutathione, 

glucuronate, or sulfate (Review: Konig J. et al., Biochimica et Biophysica Acta 1461 
(1999)377-394). 

• The bile salt pool undergoes an enterohepatic circulation that is regulated by distinct 
bile salt transport proteins, including the canalicular bile salt export pump BSEP, the 

25 ileal NA + -depentent bile salt transporter ISBT, and the hepatic sinusoidal Na + - 
taurocholate cotransporting polypeptide NTCP. Other bile salt transporters include 
the organic anion transporting polypeptides OATPs and the multidrug resistance- 
associatd proteins 2 and 3 MRP2,2 (Review: Trauner et al., Physiol. Rev. 83: 633- 
671,2003). 

30 

To date the only way to investigate hepatobiliary elimination of drugs is the bile 
fistula study done in rats, a complex and time-consuming in vivo assay. In addition 
the bile fistula study doesn't give any information about the molecular basis of the 




elimination process. Here we describe a cell-based assay system in which a liver 
uptake transporter (in this case human NTCP) is constitutively expressed together 
with a liver export pump (usually an ABC transporter, in this case human MRP2 or 
human BSEP) in the polarized canine kidney cell line MDCKII (for reference see 
Louvard, 1980; Fuller et ah, 1984). The resulting cell lines therefore express two 
transgenes in one cell line (i) human NTCP together with human BSEP (MDCKII- 
hNTCP/hBSEP) and (ii) human NTCP together with human MRP2 (MDCKll-hNTCP- 
hMRP2). The cells are cultivated in 6-well filter inserts on a porous filter membrane 
thereby separating a basblateral from an apical compartment (Fig.1). Putative 
) substrates (e.g. pharmaceutical compounds) are added either to the apical or to the 
basolateral compartment for transcellular (vectorial) transport measurements in both 
directions. Samples are taken from each compartment after the indicated incubation 
periods. Substrates of a given transporter combination (in this case human NTCP 
and human BSEP or human NTCP and human MRP2) display a significant net 
15 transport from the basolateral to the apical compartment when compared to un- 
transfected or mono-transfected cells (Fig. 4 + 5). 

With the aid of the described dual-transfected cell lines one will be able to predict 
whether the transporter combinations human NTCP / human BSEP or human NTCP 
20 / human MRP2 participates in the hepatobiliary elimination process of a 
pharmaceutical compound or not. 

During pharmaceutical drug development compounds frequently fail to proceed into 
_ clinical development due to their weak pharmacokinetic profile. One reason for this is 
^7 their rapid elimination via the liver into the bile. Examples for this phenomenon are 
the chlorogenic acid derivatives, potent and specific inhibitors of the glucose-6- 
phosphate translocase (a potential novel pharmaceutical target in patients with type 
II diabetes). Pharmacodynamic studies showed that the blood glucose lowering 
effect of several chlorogenic acid derivatives lasts only a short time after bolus i.v. 
30 injection in Wistar rats. Bile fistula studies indicated that this weak pharmacodynamic 
effect is the result of unsuitable pharmacokinetics due to a rapid hepatobiliary 
elimination process. With the aid of dual-transfected transporter cell lines one will be 
able to identify the transporters involved in the elimination process. In a next step the 
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compounds can than be chemically modified in a way that alleviates their affinity to 
the eliminating transporter to improve the compounds pharmacology. 

Another application area would be the examination of potential drug-drug 
5 interactions at the level of transporters in the liver. For example competition of two 
drugs for one transporter in the liver may alter the pharmacokinetic profile of both 
compounds significantly. In addition, inhibition or activation of a transporter by one 
drug may also change the pharmacological behaviour of the transporters substrates. 

10 The invention pertains to a mammalian cell having a first and a second side which 
both sides form part of the outer surface of such cell and which both sides are 
different from the areas of contact of such cell and which first and second side are 
distinguished from each other by their localization at opposite ends of such cell 
wherein the first side carries a functional hNTCP protein and the second side carries 
15 a functional hBSEP protein. 

The first side of such a mammalian cell is e.g. the basolateral side and the second 
side is the apical side or the first side is e.g. the apical side and the second side is 
e.g. the basolateral side. 

The mammalian cell can be chosen from epithelial cells in particular of the kidney, of 
the bowels systems, of the liver or of the blood/brain barrier. 

Such cells can be immortalized cells or recombinant cells. The cells can be taken 
from ordinary mammalian tissue or a primary cell culture. Such cells can be also the 
cells of a cell culture as e.g. LLC-PK1 cells or MDCKII cells in particular also when 
carrying recombinant vectors. 

The LLC-PK1 cells may harbor for example a recombinant vector e.g. suitable for 
expressing the hNTCP protein (as exemplified by SEQ ID NO. 4 ) and a 
recombinant vector suitable for expressing the hBSEP protein (as exemplified by 
SEQ ID NO. 5 ). As alternative the hNTCP and the hBSEP proteins may be 
expressed simultaneously from the same vector construct. The MDCKII cell may 
harbor for example a recombinant vector e.g. suitable for expressing the hNTCP 
protein (as exemplified by SEQ ID NO. 4 ) and a recombinant vector suitable for 
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expressing the hBSEP protein (as exemplified by SEQ ID NO. 5 ). Such a 
mammalian cell of a MDCKII type carrying two separate vectors with ability of 
expressing hNTCP protein and hBSEP protein has been deposited with the DSMZ- 
Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, Mascheroder 
Weg 1b, D-38124 Braunschweig. The identification number is DSM ACC2643. 

The invention pertains further to the manufacturing of a mammalian cell carrying a 
functional human NTCP protein and a functional human BSEP protein wherein 



a mammalian cell is provided; 

a vector is provided encompassing the coding sequence of hNTCP (e.g. 

vector as drafted according or equal to SEQ ID NO. 4 ); 

a vector is provided encompassing the coding sequence of hBSEP (e.g. 

vector as drafted according or equal to SEQ ID NO. 5); 

the mammalian cell from a] is transformed by a vector from b] and by a vector 

from c] either simultaneously or consecutively; 

a double transfectant cell from d] is identified and propagated. 
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The mammalian cells that shall be provided for such a manufacturing may be 
epithelial cells in particular of the kidney, of the lowels system, of the liver or of the 
blood/brain barrier. Such cells can be also immortalized cells or recombinant cells. 
The cells can be taken from ordinary mammalian tissue or a primary cell culture. 
Such double transfectant cells according to e] resulting from the manufacturing as 
mentioned above may be e.g. LLC-PK1 cells or MDCKII cells in particular when 
carrying recombinant vectors. Such LLC-PK1 cells may harbor for example a 
recombinant vector e.g. suitable for expressing the hNTCP protein (as exemplified 
by SEQ ID NO. 4 ) and a recombinant vector suitable for expressing the hBSEP 
protein (as exemplified by SEQ ID NO. 5 ). The hNTCP and the hBSEP proteins 
may be expressed also simultaneously from the same vector construct. Such 
MDCKII cells may harbor for example a recombinant vector e.g. suitable for 
expressing the hNTCP protein (as exemplified by SEQ ID NO. 4 ) and a 
recombinant vector suitable for expressing the hBSEP (as exemplified by SEQ ID 
NO 5 ) Such a mammalian cell of a MDCKII type carrying two separate vectors 
with ability of expressing hNTCP protein and hBSEP protein has been deposited w.th 
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the DSMZ-Deutsche Sammlung von Mikroorganismen and Zellkulturen GmbH, 
Mascheroder Weg 1 b, D-38124 Braunschweig. The identification number is DSM 
ACC2643. 



The invention pertains further to a monolayer of cells comprising at least two cells of 
mammalian cells having a first and a second side which both sides form part of the 
outer surface of such cell and which both sides are different from the areas of 
contact of such cell and which first and second side are distinguished from each 
other by their localization at opposite ends of such cell wherein the first side carries a 
functional hNTCP protein and the second side carries a functional hBSEP protein. 
Such a monolayer could occupy a part or the whole of a solid surface. The invention 
pertains also to such a solid surface carrying a monolayer of such cells. 



The solid surface could be formed by a plastic. The solid surface could also be part 
15 of a petri dish or a filter-insert. 



The invention pertains further to a petri dish carrying a monolayer of cells as 
described before and as well as to a filter-insert carrying a monolayer of cells as 
described before. The membrane support of such a filter-insert could be made of 
polycarbonat and/or polyester. The pore size of the membrane support could be of 

0. 4 urn +/- 0.2 urn. An according example of a filter-insert has been depicted in Fig. 

1. There are several manufacturers of filter-inserts that could be used for placing the 
said monolayer of cells on it. Such manufacturers are for example "Corning Inc., 
Corning, NY" or "Millipore". 
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The mammalian cell having a first and a second side which both sides form part of 
the outer surface of such cell and which both sides are different from the areas of 
contact of such cell and which first and second side are distinguished from each 
other by their localization at opposite ends of such cell wherein the first side carries a 
functional hNTCP protein and the second side carries a functional hBSEP protein 
could be used e.g. for determining pharmacological profiles with respect to 
hepatobiliary elimination and/or renal excretion and/or brain resorption and/or 
intestinal resorption. For such use the mammalian cells could form part of a 
monolayer on a solid surface and/or a petri disk and/or on a filter insert. 





The invention pertains further to a mammalian cell having a first and a second side 
which both sides form part of the outer surface of such cell and which both sides are 
different from the areas of contact of such cell and which first and second side are 
distinguished from each other by their localization at opposite ends of such cell 
wherein the first side carries a functional hNTCP protein and the second side carries 
a functional hMRP2 protein. 

The first side of such a mammalian cell is e.g. the basolateral side and the second 
side is the apical side or the first side is e.g. the apical side and the second side is 
e.g. the basolateral side. 

The mammalian cell can be chosen from epithelial cells in particular of the kidney, of 
the bowels system, of the liver or of the blood/brain barrier. 

Such cells can be immortalized cells or recombinant cells. The cells can be taken 
from ordinary mammalian tissue or a primary cell culture. Such cells can be also the 
cells of a cell culture as e.g. LLC-PK1 cells or MDCKII cells particularly when 
carrying recombinant vectors. 

The LLC-PK1 cells may harbor for example a recombinant vector e.g. suitable for 
expressing the hNTCP protein (as exemplified by SEQ ID NO. 4 ) and a 
recombinant vector suitable for expressing the hMRP2 protein (as exemplified by 
SEQ ID NO. 6). As alternative the hNTCP and the hMRP2 proteins may be 
expressed simultaneously from the same vector construct. The MDCKII cell shall 
harbor a recombinant vector e.g. suitable for expressing the hNTCP protein (as 
exemplified by SEQ ID NO. 4) and a recombinant vector suitable for expressing the 
hMRP2 protein (as exemplified by SEQ ID NO. 6 ). Such a mammalian cell of a 
MDCKII type carrying two separate vectors with ability of expressing hNTCP protein 
and hMRP2 protein has been deposited with the DSMZ-Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH, Mascheroder Weg 1 b, D-38124 
Braunschweig. The identification number is DSM ACC2644. 



The invention pertains further to the manufacturing of a mammalian cell carrying a 
functional human NTCP protein and a functional human MRP2 protein wherein 



8 



a] a mammalian cell is provided; 

b] a vector is provided encompassing the coding sequence of hNTCP (e.g. a 
vector as drafted according or equal to SEQ ID NO. 4 ); 

c] a vector is provided encompassing the coding sequence of hMRP2 (e.g. 
vector as drafted according or equal to SEQ ID NO. 6 ); 



d] 




the mammalian cell from a] is transformed by a vector from b] and by a vector 
from c] either simultaneously or consecutively; 

a double transfectant cell from d] is identified and propagated. 



The mammalian cells that shall be provided for such a manufacturing may be. 
i epithelial cells in particular of the kidney, of the bowels system, of the liver or of the 
blood/brain barrier. Such cells can be also immortalized cells or recombinant cells. 
The cells can be taken from ordinary mammalian tissue or a primary cell culture. 

Such double transfectant cells according to e] resulting from the manufacturing as 
0 mentioned above may be the cells of a cell culture as e.g. LLC-PK1 cells or MDCKII 
cells particularly when carrying recombinant vectors. Such LLC-PK1 cells may 
harbor for example a recombinant vector e.g. suitable for expressing the hNTCP 
protein (as exemplified by SEQ ID NO. 4 ) and a recombinant vector suitable for 
expressing the hMRP2 protein (as exemplified by SEQ ID NO. 6). The hNTCP and 
the hMRP2 proteins may be expressed also simultaneously from the same vector 
construct. Such MDCKII may harbor for example a recombinant vector e.g. suitable 
for expressing the hNTCP protein (as exemplified by SEQ ID NO. 4 ) and a 
recombinant vector suitable for expressing the hMRP2 (as exemplified by SEQ ID 
NO. 6). Such a mammalian cell of a MDCKII type carrying two separate vectors with 
30 ability of expressing hNTCP protein and hMRP2 protein has been deposited with the 
DSMZ- Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, 
Mascheroder Weg 1 b, D-38124 Braunschweig. The identification number is DSM 
ACC2644. 
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The invention pertains further to a monolayer of cells comprising at least two cells of 
mammalian cells having a first and a second side which both sides form part of the 
outer surface of such cell and which both sides are different from the areas of 
contact of such cell and which first and second side are distinguished from each 
5 other by their localization at opposite ends of such cell wherein the first side carries a 
functional hNTCP protein and the second side carries a functional HMRP2 prote.n. 

Such a monolayer could occupy a part or the whole of a solid surface. The invention 
pertains to such a solid surface carrying a monolayer of such cells. 
10 The solid surface could be formed by a plastic. The solid surface could also be part 
of a petri dish or a filter-insert. 

The invention pertains also to a petri dish carrying a monolayer of cells as described 
and further to a filter-insert carrying a monolayer of cells as described. The 
15 membrane support of such a filter-insert could be made of polycarbonat and/or 

polyester. The pore size of the membrane support could be of about 0.4 urn +/- 0.2 
urn. An according example of a filter-insert has been depicted in Fig. 1 . There are 
several manufacturers of filter-inserts that could be used for placing the said 
monolayer of cells on it. Such manufacturers are for example "Corning Inc., Corning, 
20 NY" or "Millipore". 

The mammalian cell having a first and a second side which both sides form part of 
the outer surface of such cell and which both sides are different from the areas of 
contact of such cell and which first and second side are distinguished from each 
other by their localization at opposite ends of such cell wherein the first side carries a 
functional hNTCP protein and the second side carries a functional hMRP2 prote.n 
could be used e.g. for determining pharmacological profiles with respect to 
hepatobiliary elimination and/or renal excretion and/or brain resorption and/or 
intestinal resorption. For such use the mammalian cells could form part of a 
30 monolayer on a solid surface and/or a petri dish and/or on a filter-insert. 
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A protein shall be regarded as functional in context of this invention when it is in a 
condition to perform an activity in a biological context in particular as part of a living 
cell. Such an activity is detectable e.g. by an assy. A transporter is functional for 
example when this transporter moves a compound in particular the transporter's 
biological substrate from outside a cell into the inner compartment of this cell or vice 
versa. A biological substrate of an ion transporter protein consists e.g. of a 
monovalent and/or a divalent ion or other ions. The substrate of a glucose 
transporter protein is e.g. glucose. The substrate of a multiple drug resistance 
protein is e.g. a drug alone or conjugated to gluthathion or gluconate. 

The handling of proteins in context of this invention can be achieved by a person 
skilled in the art by applying the according protocols from "Current Protocols in 
Protein Science" published by John Wiley & Sons (edited by: John E. Coligan, Ben 
M. Dunn, Hidde L. Ploegh, David W. Speicher, Paul T. Wingfield; 0-471-11184-8- 
Looseleaf ; 0-471 -1 4098-8-CDROM). 

The handling of techniques concerning Molecular Biology as e.g. cloning, 
transforming of cells, sequencing, modifying promoters, expressing proteins or 
others can be achieved a the person skilled in the art by applying the according 
protocols from "Current Protocols in Molecular Bioiogy" published by John Wiley & 
Sons (edited by: Fred M. Ausubel, Roger Brent, Robert E. Kingston, David D. Moore, 
J.G. Seidman, John A. Smith, Kevin Struhl; 0-471-50338-X-Looseleaf; 0-471- 
306614CDROM). 

The handling of biological cells can be achieved a the person skilled in the art by 
applying the according protocols from "Current Protocols in Cell Biology" published 
by John Wiley & Sons (edited by: Juan S. Bonifacino, Mary Dasso, Jennifer 
Lippincott-Schwartz, Joe B. Harford, Kenneth M. Yamada; 0-471 -24 1 08-3-Looseleaf; 
0-471-241 05-9-CDROM). 
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Examples 

Cell line generation: MDCKII-hBSEP and MDCKII-hNTCP/hBSEP: 

5 MDCK II cells (Madin-Darby canine kidney cells, strain II) were cultured in 

Dulbecco's modified essential medium supplemented with 10 % fetal calf serum at 
37°C, 5% C0 2 , and 95% humidity. The MDCKII-hNTCP/hBSEP cell line (DSM 
ACC2643) was generated as follows: MDCK II cells were first transfected with the 
mammalian expression plasmid pcDNA3.1zeo(+)-hBSEP using the FuGene6 

10 transfection reagent as described by the manufacturer (Roche). Transfected cells 
were selected with 1000ug/ml Zeocin. During selection the tissue culture medium 

•was additionally supplemented with 100units/ml penicillin G and 100ug/ml 
streptomycin sulfate. The resulting cell clones were screened in the immunoblot for 
recombinant human BSEP expression followed by analysis of the proper subcellular 
15 localization of recombinant human BSEP using Confocal Laser Scanning 
Microscopy. The cell clone with the highest expression of human BSEP and 
concomitant localization of the recombinant BSEP at the apical membrane was 
chosen for FuGene6 transfection with the mammalian expression plasmid 
pcDNA3.1neo(+)-hNTCP. Transfected cells were selected with 800ug/ml Geneticin 
20 and 1 0OOug/ml Zeocin. During selection the tissue culture medium was additionally 
supplemented with 100units/ml penicillin G and 100ug/ml streptomycin sulfate. 
Double-transfected cell lines were again screened in the immunoblot for the 

• expression of both, recombinant human BSEP and recombinant human NTCP, 
followed by analysis of the proper subcellular localization of both transgenes by 
25 Confocal Laser Scanning Microscopy. The cell clone with the highest expression and 
proper subcellular localization of both, human BSEP (apical membrane) and human 
NTCP (basolateral membrane) was chosen for further examinations. 

Cell line generation: MDCKII-hMRP2 and MDCKII-hNTCP/hMRP2: 

30 

MDCK II cells (Madin-Darby canine kidney cells, strain II) were cultured in 
Dulbecco's modified essential medium supplemented with 10 % fetal calf serum at 
37°C, 5% CO z , and 95% humidity. The MDCKII-hNTCP/hMRP2 cell line (DSM 
ACC2644) was generated as follows: MDCK II cells were first transfected with the 
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mammalian expression plasmid pcDNA3.1zeo(+)-hMRP2 using the FuGene6 
transfection reagent as described by the manufacturer (Roche). Transfected cells 
were selected with 1000ug/ml Zeocin. During selection the tissue culture medium 
was additionally supplemented with 100units/ml penicillin G and 100ug/ml 

5 streptomycin sulfate. The resulting cell clones were screened in the immunoblot for 
recombinant human MRP2 expression followed by analysis of the proper subcellular 
localization of recombinant human MRP2 using Confocal Laser Scanning 
Microscopy. The cell clone with the highest expression of human MRP2 and 
concomitant localization of the recombinant MRP2 at the apical membrane was 

10 chosen for FuGene6 transfection with the mammalian expression plasmid 

pcDNA3.1neo(+)-hNTCP. Transfected cells were selected with 800ug/ml Geneticin 

•and 1000ug/ml Zeocin. During selection the tissue culture medium was additionally 
supplemented with 100units/ml penicillin G and 100ug/ml streptomycin sulfate. 
Double-transfected cell lines were again screened in the immunoblot for the 
15 expression of both, recombinant human MRP2 and recombinant human NTCP, 
followed by analysis of the proper subcellular localization of both transgenes by 
Confocal Laser Scanning Microscopy. The cell clone with the highest expression and 
proper subcellular localization of both, human MRP2 (apical membrane) and human 
NTCP (basolateral membrane) was chosen for further examinations. 

20 

Cell line generation: MDCKII-hNTCP: 

• MDCK II cells (Madin-Darby canine kidney cells, strain II) were cultured in 
Dulbecco's modified essential medium supplemented with 10 % fetal calf serum at 
25 37°C, 5% C0 2 , and 95% humidity. For generation of the MDCKII-hNTCP cell line 
MDCKII cells were transfected With the mammalian expression plasmid 
pcDNA3.1neo(+)-hNTCP using the FuGene6 transfection reagent as described by 
the manufacturer (Roche). Transfected cells were selected with 800ug/ml Geneticin. 
During selection the tissue culture medium was additionally supplemented with 
30 100units/ml penicillin G and 100ug/ml streptomycin sulfate. The resulting cell clones 
were screened in the immunoblot for recombinant human NTCP expression followed 
by functional analysis of the cell clones with the highest human NTCP expression 
(measurement of taurocholic acid uptake in MDCKII-hNTCP cells compared with 
parental MDCKII cells). 



13 



10 



Chemicals: PH(G)]-Bromosulfophthalein (BSP) (0.2 TBq/mmol) was purchased from 
Hartmann Analytic (Braunschweig, Germany). [2,4- 3 H(N)]-Cholic acid (0.7 
TBq/mmol), [Carboxyl-14C]-Chenodeoxycholic acid (1.9 GBq/mmol) and [Glycine-2- 
3H]-Glycocholic acid (0.15 TBq/mmol) were obtained from Biotrend Chemikalien 
(Koln, Germany). PH(G)]-Taurocholic acid (0.1 TBq/mmol) was purchased from 
PerkinElmer Life Sciences. Zeocin, geneticin, penicillin G, and streptomycin sulfate 
were obtained from Invitrogen. Additional chemicals of analytical purity were 
obtained from Sigma and Merck (Darmstadt, Germany) 




Transport assays: 

Cells were grown at confluency (1x10 6 cells / insert) on polyester membrane inserts 
'(pore size 0.4uM, insert diameter 24mm, Costar) in 6-well plates for 6 days. 
Overexpression of transgenes was induced by addition of 10mM sodium butyrate to 
1 5 the growth medium for 24 h. The next day cells were washed once with transport 
assay buffer (142mM NaCI, 5mM KCI, 5mM Glucose, 1.5mM CaCI 2 , 1.2mM MgS0 4 , 
1mM KH 2 P0 4 , 12.5 mM HEPES, pH 7.3) followed by addition of PH]- or [ 14 C]-labeled 
compounds either to the apical (1 .5 ml) or to the basolateral (2.5 ml) compartments. 
The opposite compartments were filled with transport assay buffer. After the 
20 indicated incubation periods (37°C) a sample was taken from every compartment 
and measured by scintillation counting (Wallac, Winspectral 1414 Liquid Scintillation 
Counter). The transcellular transport in percent of input was calculated as follows: 

Transport from the apical to the basolateral compartment: 
Transport A -B t%] = 100 x (Radioactivity ( t <|) basolateral [dpm] x 2,5 [ml])/ 

(Radioactivity (to) apical [dpm] x 1 ,5 [ml]) 




30 



Transport from the basolateral to the apical compartment: 
Transporte-A [%] = 100 x (Radioactivity (ti) apical [dpm] x 1,5 [ml])/ 

(Radioactivity (to) basolateral [dpm] x 2,5 [ml]) 
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The following deposits of biological material were made in context of the present 
invention with DSMZ - Deutsche Sammlung von Mikroorganismen und Zellkulturen 
GmbH, MascheroderWeg 1b; D-38124 Braunschweig: 

5 DSM ACC2643: Double transfected MDCKII cells harboring a vector for expressing 
human NTCP (Seq ID No. 4) and a vector for expressing human BSEP (Seq ID No. 

5) . 

DSM ACC2644: Double transfected MDCKII cells harboring a vector for expressing 
10 human NTCP (Seq ID No. 4) and a vector for expressing human MRP2 (Seq ID No. 

6) . 




15 Description of the Figures: 



Fig 1 : 

General assay design 

The cells are grown as a polarized and tight monolayers in filter inserts on a porous 
20 membrane (polyester membrane, pore size 0.4uM, 6-well) thereby separating a 
basolateral from an apical compartment. Fig. 1 is depicting an example of an filter- 
insert carrying a monolayer of cells. 




Fig 2: 

Assay design MDCK H-hNTCP/hBSEP 
Schematic representation of MDCK II cells (A) and MDCK II cells transfected with 
human BSEP (B), or human NTCP (C), or both human NTCP and human BSEP (D). 
Human NTCP is localized at the basolateral membrane whereas human BSEP is 
localized at the apical membrane. The mono-transfected cell lines MDCKII-hBSEP 
30 and MDCKII-hNTCP serve as controls. 



V, 

15 

Fig 3: 

Assay design MDCK H-hNTCP/hMRP2 

Schematic representation of MDCK II cells (A) and MDCK II cells transfected with 
human MRP2 (B), or human NTCP .(C), or both human NTCP and human MRP2 (D). 
5 Human NTCP is localized at the basolateral membrane whereas human MRP2 is 
localized at the apical membrane. The mono-transfected cell lines MDCKII-hMRP2 
and MDCKII-hNTCP serve as controls. 

Fig 4: 

10 Vectorial transport of [ 3 H]Glycocholic acid, [ 3 H]Taurocholic acid, [ 3 H]Cholic acid, 
[ 14 C]Chenodeoxycholic acid and [ 3 H]BSP (Bromosulfophthalein) in MDCKII- 

•hNTCP/hBSEP monolayers 
MDCKII, MDCKII-hNTCP, MDCKII-hMRP2 and MDCKII-hNTCP/hMRP2 cells were 
grown on filter inserts for 6 days (1x10 s cells/well). Transgene expression was 
15 induced by addition of 1 0mM sodium butyrate for 24 h. The indicated amount of each 
compound was given separately either to the basolateral or to the apical 
compartment. After 45 and 90 minutes an aliquot was taken from the opposite 
compartment and analysed by liquid scintillization counting. A-B indicates addition of 
compound to the apical compartment and sampling from the basolateral 
20 compartment (vectorial transport from the apical to the basolateral compartment). B- 
A indicates addition of compound to the basolateral compartment and sampling from 
the apical compartment (vectorial transport from the basolateral to the apical 
(■ i^^j^ compartment). Data represent means ± SD (n=3). 

j 25 Fig 5: 

Vectorial transport of [ 3 H]BSP (Bromosulfophthalein), PH]Cholic acid and 
[ 3 H]Taurocholic acid in MDCKII-hNTCP/hMRP2 monolayers 
MDCKII, MDCKII-hMRP2 and MDCKII-hNTCP/hMRP2 cells were grown on filter 
inserts for 6 days (1x10 6 cells/well). Transgene expression was induced by addition 
30 of 10mM sodium butyrate for 24 h. 100nM of each compound was given separately 
either to the basolateral or to the apical compartment. After one and two hours an 
aliquot was taken from the opposite compartment and analysed by liquid 
scintillization counting. A-B indicates addition of compound to the apical 
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compartment and sampling from the basolateral compartment (vectorial transport 
from the apical to the basolateral compartment). B-A indicates addition of compound 
to the basolateral compartment and sampling from the apical compartment (vectorial 
transport from the basolateral to the apical compartment). Data represent means ± 
SD (n=3). 

Fig 6: 

Human NTCP (Slc10A1), cDNA sequence; Length: 1061 bases; the depicted 
sequence is corresponding to Seq ID No. 1 . 

Fig. 7: 

.Human BSEP (ABCB1 1), cDNA sequence; Length: 3966 bases; the depicted 
'sequence is corresponding to Seq ID No. 2. 

Fig. 8: 

Human MRP2 (ABCC2), cDNA sequence; Length: 4650 bases; the depicted 
sequence is corresponding to Seq ID No. 3. 

i 

Fig. 9: 

pcDNA3.1neo(+)-hNTCP, complete sequence; Length: 6433 bases; the depicted 
sequence is corresponding to Seq ID No. 4: 

The human Sodium-dependent Tauchocholate Cotransporting Protein (hNTCP, 
i SLC10A1) cDNA was cloned into the MCS of pcDNA3.1neo(+) (Invitrogen) via 
' BamH I and Xba I. Human NTCP cDNA was PCR-amplified with gene-specific 
primers from a human liver cDNA library (Clontech). The cloned cDNA corresponds 
to GenBank Accession # L21893 from position 76-1136 (coding sequence: position 
83-1 132). Exceptions: position 616 c ^ t (no amino acid change) and position 774 1 
-> c (amino acid change F -> S). 

Fig. 10: 

pcDNA3.1zeo(+)-hBSEP, complete sequence; Length: 9043 bases; the depicted 
sequence is corresponding to Seq ID No. 5: 
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The human Bile Salt Export Pump (hBSEP, ABCB1 1) cDNA was cloned by using the 
Gateway Technology (Invitrogen). Human BSEP cDNA was PCR-amplified with attB- 
modified gene-specific primers from a human fetal liver cDNA library (Clontech) and 
cloned via pDONR221 into the pcDNA3.1zeo(+) vector (Invitrogen, modified to a 
5 Gateway destination vector). The cloned cDNA corresponds to GenBank Accession 
# AF136523from position 128-4093 (coding sequence: position 128-4093). 
Exception: position 321 1 G -> A (no amino acid change). 

Fig. 11: 

10 pcDNA3.1zeo(+)-hMRP2, complete sequence; Length: 9658 bases; the depicted 
sequence is corresponding to Seq ID No. 6: 

• The human Multidrug-resistance Related Protein 2 (MRP2, ABCC2) cDNA was 
cloned into the MCS of pcDNA3.1zeo(+) (Invitrogen) via Nhe I and Afl II. Human 
MRP2 cDNA was PCR-amplified with gene-specific primers from a HepG2 cDNA 
15 library. The cloned cDNA corresponds to GenBank Accession # X96395 from 

position 26-4675 (coding sequence: position 38-4675). Exception: position 4009 C -> 
T (no amino acid change). 

20 

Description of Seq IDs: 
Seq ID No. 1: 

Human NTCP (Slc10A1), cDNA sequence; Length: 1061 bases; Seq ID No. 1 is 
25 disclosed in Fig. 6. 

Seq ID No. 2: 

Human BSEP (ABCB1 1), cDNA sequence; Length: 3966 bases; Seq ID No. 2 is 
disclosed in Fig. 7. 

30 

Seq ID No. 3: 

Human MRP2 (ABCC2), cDNA sequence; Length: 4650 bases; Seq ID No. 3 is 
disclosed in Fig. 8. 
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Seq ID No. 4: 

pcDNA3.1neo( + )-hNTCP, complete sequence; Length: 6433 bases; Seq ID No. 4 is 
disclosed in Fig. 9. 

5 Seq ID No. 5: 

P cDNA3.1zeo(+)-hBSEP, complete sequence; Length: 9043 bases; Seq ID No. 5 is 
disclosed in Fig. 10. 

Seq ID No. 6: 

10 pcDNA3.1zeo( + )-hMRP2, complete sequence; Length: 9658 bases; Seq ID No. 6 is 
disclosed in Fig. 1 1 . 




15 References: 
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infects and matures only through the basolateral surface of the polarized 
epithelial cell line, MDCK. Cell 38: 65-77 
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Claims 
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Mammalian cell having a first and a second side which both sides form part of 
the outer surface of such cell and which both sides are different from the 
areas of contact of such cell and which first and second side are distinguished 
from each other by their localization at opposite ends of such cell wherein the 
first side carries a functional hNTCP protein and the second side carries a 
functional hBSEP protein. 

Mammalian cell as claimed in claim 1 wherein the first side is the basolateral 
side and the second side is the apical side. 

Mammalian cell as claimed in claim 1 and 2 wherein the first side is the apical 
side and the second side is the basolateral side. 

Mammalian cell as claimed in claim 1 to 3 wherein the cell is an epithelial cell 
of the kidney, of the bowels system, of the liver or of the blood/brain barrier. 
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5. Mammalian cell as claimed in claims 1 to 4 which is immortalized. 



Mammalian cell as claimed in claims 1 to 5 which is a recombinant cell. 
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Mammalian cell as claimed in claims 1 to 6 which is a LLC-PK1 cell harboring 
a vector for expressing hNTCP protein and a vector for expressing hBSEP 
protein. 
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8. Mammalian cell as claimed in claims 1 to 6 which is a MDCKII cell harboring a 
vector for expressing the hNTCP protein and a vector for expressing the 
hBSEP protein. 

9. Mammalian cell as claimed in claim 8 according to deposit DSM ACC2643. 



10. Manufacturing of a mammalian cell according to claims 1 to 9 wherein 
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a] a mammalian cell is provided; 

b] a vector is provided encompassing the coding sequence of hNTCP; 

c] a vector is provided encompassing the coding sequence of hBSEP; 

d] the mammalian cell from a] is transformed by a vector from b] and by a 
5 vector from c] either simultaneously or consecutively; 

e] a double transfectant cell from d] is identified and propagated. 

1 1 . Manufacturing of a mammalian cell as claimed in claim 1 0 wherein the 
mammalian cell from a] is an epithelial cell of the kidney, of the bowels 

10 system, of the liver or of the blood/brain barrier. 

12. Manufacturing of a mammalian cell as claimed in claims 1 0 and 1 1 wherein 
the mammalian cell from a] is immortalized. 

15 13. Manufacturing of a mammalian cell as claimed in claims 10 to 12 wherein the 
vector from b] is a polynucleotide according to Fig.9 (Seq ID No. 4). 

14. Manufacturing of a mammalian cell as claimed in claims 10 to 12 wherein the 
vector from c] is a polynucleotide according to Fig. 10 (Seq ID No. 5). 

20 

15. Manufacturing of a mammalian cell as claimed in claims 10 to 14 wherein the 
mammalian cell is build up as deposited cell DSM ACC2643. 

1 6. A monolayer of cells comprising at least two cells according to claims 1 to 9. 

17. A solid surface carrying a monolayer according to claim 1 6 wherein the 
monolayer of cells could occupy the part or the whole of the solid s.urface. 

1 8. A solid surface as claimed in claim 1 7 that is formed by a plastic. 

30 

19. A solid surface as claimed in claim 1 7 that is part of a petri dish. 
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20. A solid surface as claimed in claim 1 7 that is part of a filter-insert. 
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21. A petri dish carrying a monolayer of cells according to claim 17. 

22. A filter-insert carrying a monolayer of cells according to claim 17. 

23. A filter-insert as claimed in claim 22 wherein the membrane support is made 
of polycarbonat and/or polyester. 

24. A filter-insert as claimed in claims 22 and 23 wherein the membrane support's 
pore size is 0.4 pm. 

25. Use of a mammalian cell of claims 1 to 9 for determining a pharmacological 
profile with respect to hepatobiliary elimination and/or renal excretion and/or 

' brain resorption and/or intestinal resorption. 

26. Use of a mammalian cell as claimed in claim 25 wherein the mammalian cell 
forms part of a monolayer on a solid surface and/or on a petri dish and/or on a 
filter-insert. 

27. Mammalian cell having a first and a second side which both sides form part of 
the outer surface of such cell and which both sides are different from the 
areas of contact of such cell and which first and second side are distinguished 
from each other by their localization of opposite ends of such cell wherein the 
first side carries a functional hNTCP protein and the second side carries a 
functional hMRP2 protein. 

28. Mammalian cell as claimed in claim 27 wherein the first side is the basolateral 
side and the second side is the. apical side. 

29. Mammalian cell as claimed in claims 27 and 28 wherein the first side is the 
apical side and the second side is the basolateral side. 

30. Mammalian cell as claimed in claims 27 to 29 wherein the cell is an epithelial 
cell of the kidney, of the bowels system, of the liver or of the blood/brain 
barrier. 
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31 . Mammalian cell as claimed in claims 27 to 30 which is immortalized. 

32. Mammalian cell as claimed in claims 27 to 31 which is a recombinant cell. 

33. Mammalian cell as claimed in claims 27 to 32 which is a LLC-PK1 cell 
harboring a vector for expressing hNTCP protein and a vector for expressing 
hMRP2 protein. 

34. Mammalian cell as claimed in claims 27 to 33 which is a MDCKII cell 
harboring a vector for expressing hNTCP protein and a vector for expressing 
hMRP2 protein. 

35. Mammalian cell as claimed in claims 27 to 34 as deposited as DSM 
ACC2644. 

36. Manufacturing of a mammalian cell according to claims 27 to 35 wherein 

a] a mammalian cell is provided; 

b] a vector is provided encompassing the coding sequence of hNTPC; 

c] a vector is provided encompassing the coding sequence of hMRP2; 

d] the mammalian cell from a] is transformed by a vector from b] and by a 
i vector from c] either simultaneously or consecutively; 

' e] a double transfectant cell from d] is identified and propagated. 

37. Manufacturing of a mammalian cell as claimed in claim 36 wherein the 
mammalian cell from a] is an epithelial cell of the kidney, of the bowels 
system, of the liver or of the blood/brain barrier. 

38. Manufacturing of a mammalian cell as claimed in claims 36 and 37 wherein 
the mammalian cell from a] is immortalized. 



39. Manufacturing of a mammalian cell as claimed in claims 36 to 38 wherein the 
vector from b] is a polynucleotide according to Fig. 9 (Seq ID No. 4). 
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40. Manufacturing of a mammalian cell as claimed in claims 36 to 38 wherein the 
the vector from c] is a polynucleotide according to Fig. 1 1 (Seq ID No. 6). 

41 . Manufacturing of a mammalian cell as claimed in claims 36 to 38 wherein the 
mammalian cell is build up as deposited cell DSM ACC2644. 

42. A monolayer of cells comprising at least two cells according to claims 27 to 
35. 

43. A solid surface carrying a monolayer according to claim 42 wherein the 
monolayer of cells could occupy the part or the whole of the solid surface. 

44. A solid surface. as claimed in claim 43 that is formed by a plastic. 

45. A solid surface as claimed in claim 43 that is part of a petri dish. 

46. A solid surface as claimed in claim 43 that is part of a filter-insert. 

47. A petri dish carrying a monolayer of cells according to claims 27 to 35. 

48. A filter-insert carrying a monolayer of cells according to claims 27 to 35. 

49. A filter-insert as claimed in claim 48 wherein the membrane support is made 
of polycarbonate and/or polyester. 

50. A filter-insert as claimed in claim 48 wherein the membrane support's pore 
size is 0.4 urn. 

51 . Use of a mammalian cell of claims 27 to 35 for determining a pharmacological 
profile with respect to hepatobiliary elimination and/or renal excretion and/or 
brain resorption and/or intestinal resorption. 
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Use of a mammalian cell as claimed in claim 51 wherein the mammalian c 
forms part of a monolayer on a solid surface and/or on a petri dish and/or 
filter-insert. 
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Summary 

The invention is referring to several dual-transfected cell lines expressing human 
NTCP (Na/taurocholate Cotransporting Protein; SLC10A1) together with human 
BSEP (Bile Salt Export Pump; ABCB11) or human MRP2 (Multidrug Resistance 
Protein; ABCC2) suitable as an in vitro tool for pharmaceutical compound profiling 
particularly as a model for hepatobiliary elimination. 
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Fig. 1 
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Fig 3 
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Fig 4 
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Fig. 5 
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Fig. 6 



CAGGAGGATGGAGGCCCACAACGCGTCTGCCCCATTCAACTTCACCCTGCCACCCAACTTTGGCAAGCGCCCCACAGACC 
TGGCACTGAGCGTCATCCTGGTGTTCATGTTGTTCTTCATCATGCTCTCGCTGGGCTGCACCATGGAGTTCAGCAAGATC 
AAGGCTCACTTATGGAAGCCTAAAGGGCTGGCCATCGCCCTGGTGGCACAGTATGGCATCATGCCCCTCACGGCCTTTGT 
GCTGGGCAAGGTCTTCCGGCTGAAGAACATTGAGGCACTGGCCATCTTGGTCTGTGGCTGCTCACCTGGAGGGAACCTGT 
CCAATGTCTTCAGTCTGGCCATGAAGGGGGACATGAACCTCAGCATTGTGATGACCACCTGCTCCACCTTCTGTGCCCTT 
GGCATGATGCCTCTCCTCCTGTACATCTACTCCAGGGGGATCTATGATGGGGACCTGAAGGACAAGGTGCCCTATAAAGG 
CATCGTGATATCACTGGTCCTGGTTCTCATTCCTTGCACCATAGGGATCGTCCTCAAATCTAAACGGCCACAATACATGC 
GCTATGTCATCAAGGGAGGGATGATCATCATTCTCTTGTGCAGTGTGGCCGTCACAGTTCTCTCTGCCATCAATGTGGGG 
AAGAGCATCATGTTTGCCATGACACGACTCTTGATTGCCACCTCCTCCCTGATGCCTTCTATTGGCTTTCTGCTGGGTTA 
TGTTCTCTCTGCTCTCTTCTGCCTCAATGGACGGTGCAGACGCACTGTCAGCATGGAGACTGGATGCCAAAATGTCCAAC 
TCTGTTCCACCATCCTCAATGTGGCCTTTCCACCTGAAGTCATTGGACCACTTTTCTTCTTTCCCCTCCTCTACATGATT 
TTCCAGCTTGGAGAAGGGCTTCTCCTCATTGCCATATTTTGGTGCTATGAGAAATTCAAGACTCCCAAGGATAAAACAAA 
AATGATCTACACAGCTGCCACAACTGAAGAAACAATTCCAGGAGCTCTGGGAAATGGCACCTACAAAGGGGAGGACTGCT 
CCCCTTGCACAGCCTAGCCCT 
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ATGTCTGACTCAGTAATTCTTCGAAGTATAAAGAAATTTGGAGAGGAGAATGATGGTTTTGAGTCAGATAAATCATATAA 

TAATGAT^GAAATCAAGGTTACAAGATGAGAAGAAAGGTGATGGCGTTAGAGTTGGCTTCTTTCAATTGTTTCGGTTTT 

CTTCATCAACTGACATTTGGCTGATGTTTGTGGGAAGTTTGTGTGCATTTCTCCATGGAATAGCCCAGCCAGGCGTGCTA 

C T CATTTT TGGCAC AATGAC AGATGTT TTTATTGACT ACGACGTTGAGT TACAAGAACT C CAGAT T C C AGGAAAAGC ATG 

TGTGAATAACACCATTGTATGGACTAACAGTTCCCTCAACCAGAACATGACAAATGGAACACGTTGTGGGTTGCTGAACA 

T CGAGAGCGAAATGAT C AAAT TT GC C AGT TAG TAT G CTGGAATTGCTGT CG C AGTAC TT AT C ACAGGAT ATATTC AAAT A 

TG CTTT TGGGT C ATTGC CG C AGCT C GT CAGAT AC AGAAAATGAGAAAATTTTACTTTAGGAGAATAATGAGAATGGAAAT 

AGGGTGGTTTGACTGCAATTCAGTGGGGGAGCTGAATACAAGATTCTCTGATGATATTAATAAAATCAATGATGCCATAG 

CTGACCAAATGGCCCTTTTCATTCAGCGCATGACCTCGACCATCTGTGGTTTCCTGTTGGGATTTTTCAGGGGTTGGAAA 

CTGACCTTGGTTATTATTTCTGTCAGCCCTCTCATTGGGATTGGAGCAGCCACCATTGGTCTGAGTGTGTCCAAGTTTAC 

GGACTATGAGCTGAAGGCCTATGCCAAAGCAGGGGTGGTGGCTGATGAAGTCATTTCATCAATGAGAACAGTGGCTGCTT 

TTGGTGGTGAGAAAAGAGAGGTTGAAAGGTATGAGAAAAATCTTGTGTTCGCCCAGCGTTGGGGAATTAGAAAAGGAATA 

GTGATGGGATTCTTTACTGGATTCGTGTGGTGTCTCATCTTTTTGTGTTATGCACTGGCCTTCTGGTACGGCTCCACACT 

TGTCCTGGATGAAGGAGAATATACACCAGGAACCCTTGTCCAGATTTTCCTCAGTGTCATAGTAGGAGCTTTAAATCTTG 

GCAATGCCTCTCCTTGTTTGGAAGCCTTTGCAACTGGACGTGCAGCAGCCACCAGCATTTTTGAGACAATAGACAGGAAA 

CCCATCATTGACTGCATGTCAGAAGATGGTTACAAGTTGGATCGAATCAAGGGTGAAATTGAATTCCATAATGTGACCTT 

CCATTATCCTTCCAGACCAGAGGTGAAGATTCTAAATGACCTCAACATGGCCATTAAACCAGGGGAAATGACAGCTCTGG 

TAGGACCCAGTGGAGCTGGAAAAAGTACAGCACTGCAACTCATTCAGCGATTCTATGACCCCTGTGAAGGAATGGTGACC 

GTGGATGGCCATGACATTCGCTCTCTTAACATTCAGTGGCTTAGAGATCAGATTGGGATAGTGGAGCAAGAGCCAGTTCT 

GTTCTCTACCACCATTGCAGAAAATATTCGCTATGGCAGAGAAGATGCAACAATGGAAGACATAGTCCAAGCTGCCAAGG 

AGGCCAATGCCTACAACTTCATCATGGACCTGCCACAGCAATTTGACACCCTTGTTGGAGAAGGAGGAGGCCAGATGAGT 

GGTGGCCAGAAACAAAGGGTAGCTATCGCCAGAGCCCTCATCCGAAATCCCAAGATTCTGCTTTTGGACATGGCCACCTC 

AGCT C TGGACAATGAGAGTGAAGC CATGGTGC AAGAAGTGCTGAGT AAGATT CAG C ATGGGCAC AC AAT C ATTT C AGT TG 
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CTCATCGCTTGTCTACGGTCAGAGCTGCAGATACCATCATTGGTTTTGAACATGGCACTGCAGTGGAAAGAGGGACCCAT 
GAAGAATTACTGGAAAGGAAAGGTGTTTACTTCACTCTAGTGACTTTGCAAAGCCAGGGAAATCAAGCTCTTAATGAAGA 
GGACATAAAGGATGCAACTGAAGATGACATGCTTGCGAGGACCTTTAGCAGAGGGAGCTACCAGGATAGTTTAAGGGCTT 
CCATCCGGCAACGCTCCAAGTCTCAGCTTTCTTACCTGGTGCACGAACCTCCATTAGCTGTTGTAGATCATAAGTCTACC 
5 TATGAAGAAGATAGAAAGGACAAGGACATTCCTGTGCAGGAAGAAGTTGAACCTGCCCCAGTTAGGAGGATTCTGAAATT 
CAGTGCTCCAGAATGGCCCTACATGCTGGTAGGGTCTGTGGGTGCAGCTGTGAACGGGACAGTCACACCCTTGTATGCCT 
TTTT AT TC AGC C AGATT CTTGGGACTTT TTCAATT CCTGATAAAGAGG AAC AAAGGT C AC AGAT CAATGGTGTGTGCCTA 
CTTTTTGTAGCAATGGGCTGTGTATCTCTTTTCACCCAATTTCTACAGGGATATGCCTTTGCTAAATCTGGGGAGCTCCT 
AACAAAAAGGCTACGTAAATTTGGTTTGAGGGCAATGCTGGGGCAAGATATTGCCTGGTTTGATGACCTCAGAAATAGCC 

10 CTGGAGCATTGACAACAAGACTTGCTACAGATGCTTCCCAAGTTCAAGGGGCTGCCGGCTCTCAGATCGGGATGATAGTC 
AATTCCTTCACTAACGTCACTGTGGCCATGATCATTGCCTTCTCCTTTAGCTGGAAGCTGAGCCTGGTCATCTTGTGCTT 
CTTCCCCTTCTTGGCTTTATCAGGAGCCACACAGACCAGGATGTTGACAGGATTTGCCTCTCGAGATAAGCAGGCCCTGG 
AGATGGTGGGACAGATTACAAATGAAGCCCTCAGTAACATCCGCACTGTTGCTGGAATTGGAAAGGAGAGGCGGTTCATT 
G AAGC ACTTGAG ACTGAGCTGGAGAAGCC C TT C AAGACAGC CATTC AGAAAGC C AAT AT TTACGG ATTCTGCTTTGC C T T 

15 TGCCCAGTGCATCATGTTTATTGCGAATTCTGCTTCCTACAGATATGGAGGTTACTTAATCTCCAATGAGGGGCTCCATT 

• TCAGCTATGTGTTCAGGGTGATCTCTGCAGTTGTACTGAGTGCAACAGCTCTTGGAAGAGCCTTCTCTTAGACCCCAAGT 
TATGCAAAAGCTAAAATATCAGCTGCACGCTTTTTTCAACTGCTGGACCGACAACCCCCAATCAGTGTATACAATACTGC 
AGGTGAAAAATGGGACAACTTCCAGGGGAAGATTGATTTTGTTGATTGTAAATTTACATATCCTTCTCGACCTGACTCGC 
AAGTTCTGAATGGTCTCTCAGTGTCGATTAGTCCAGGGCAGACACTGGCGTTTGTTGGGAGCAGTGGATGTGGCAAAAGC 
20 ACT AGC ATT C AGCTGTTGGAACGTTTCTATGAT C CTGAT CAAGGGAAGGTGAT GAT AGATGGTC ATGACAGCAAAAAAGT 
AAATGTCCAGTTCCTCCGCTCAAACATTGGAATTGTTTCCCAGGAACCAGTGTTGTTTGCCTGTAGCATAATGGACAATA 
T C AAGTATGGAGACAACAC CAAAGAAATT CC CAT GGAAAGAGT CATAGC AGCTG C AAAACAGG CTCAGCTG CATGATTTT 
GT C ATGT CACT CC CAGAGAAATATGAAAC TAACGTT GGGT C C C AGGGGT CT C AAC T CT CTAGAGGGG AGAAACAACGCAT 
TGCTATTGCTCGGGCCATTGTACGAGATCCTAAAATCTTGCTACTAGATGAAGCCACTTCTGCCTTAGACACAGAAAGTG 
25 AAAAGACGGTGCAGGTTGCTCTAGACAAAGC CAGAGAGGGT CGGAC CTG CATTGT C ATTGC C CATCGCT TGT CCAC CAT C 
CAGAACGCGGATATCATTGCTGTCATGGCACAGGGGGTGGTGATTGAAAAGGGGACCCATGAAGAACTGATGGCCCAAAA 
AGGAGC CTACT AC AAA.CTAGTC ACCACTGGAT C CC CCAT CAGTTGA 

Fig. 8 

GCCACTTTGTTTTGAGCAAACTGTTCTGGTGTGGATTCCCTTGGGCTTCCTATGGCTCCTGGCCCCCTGGCAGCTTCTCC 
ACGTGTATAAATCCAGGACCAAGAGATCCTCTACCACCAAACTCTATCTTGCTAAGCAGGTATTCGTTGGTTTTCTTCTT 
ATTCTAGCAGCCATAGAGCTGGCCCTTGTACTCACAGAAGACTCTGGACAAGCCACAGTCCCTGCTGTTCGATATACCAA 

35 TCCAAGCCTCTACCTAGGCACATGGCTCCTGGTTTTGCTGATCCAATACAGCAGACAATGGTGTGTACAGAAAAACTCCT 
GGTTCCTGTCCCTATTCTGGATTCTCTCGATACTCTGTGGCACTTTCCAATTTCAGACTCTGATCCGGACACTCTTACAG 
GGTGACAATTCTAATCTAGCCTACTCCTGCCTGTTCTTCATCTCCTACGGATTCCAGATCCTGATCCTGATCTTTTCAGC 
ATTTTCAGAAAATAATGAGTCATCAAATAATCCATCATCCATAGCTTCATTCCTGAGTAGCATTACCTACAGCTGGTATG 
AC AG CAT C ATTCTGAAAGGCTACAAGCGTCCT CTGACACTCGAGGATGT CTGGG AAGTTGATGAAGAGATGAAAAC CAAG 

40 AC ATT AGTGAGCAAGTTTGAAACGCACATGAAGAGAGAGCTG C AG AAAGC C AGG CGGGCACT CCAGAGACGGCAGGAGAA 
GAGCTCCCAGCAGAACTCTGGAGCCAGGCTGCCTGGCTTGAACAAGAATCAGAGTCAAAGCCAAGATGCCCTTGTCCTGG 
AAGATGTT GAAAAGAAAAAAAAGAAGTCTGGGAC CAAAAAAGAT GTT C C AAAAT C CTGGTTGATGAAGGCTCTGTTCAAA 
ACTTTCTACATGGTGCTCCTGAAATCATTCCTACTGAAGCTAGTGAATGACATCTTCACGTTTGTGAGTCCTCAGCTGCT 
GAAATTGCTGATCTCCTTTGCAAGTGACCGTGACACATATTTGTGGATTGGATATCTCTGTGCAATCCTCTTATTCACTG 
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CGGCTCTCATTCAGTCTTTCTGCCTTCAGTGTTATTTCCAACTGTGCTTCAAGCTGGGTGTAAAAGTACGGACAGCTATC 
ATGGCTTCTGTATATAAGAAGGCATTGACCCTATCCAACTTGGCCAGGAAGGAGTACACCGTTGGAGAAACAGTGAACCT 
GATGTCTGTGGATGCCCAGAAGCTCATGGATGTGACCAACTTCATGCACATGCTGTGGTCAAGTGTTCTACAGATTGTCT 
TATCTATCTTCTTCCTATGGAGAGAGTTGGGACCCTCAGTCTTAGCAGGTGTTGGGGTGATGGTGCTTGTAATCCCAATT 
5 AATGCGATACTGTCCACCAAGAGTAAGACCATTCAGGTCAAAAATATGAAGAATAAAGACAAACGTTTAAAGATCATGAA 
TGAGATTCTTAGTGGAATCAAGATCCTGAAATATTTTGCCTGGGAACCTTCATTCAGAGACCAAGTACAAAACCTCCGGA 
AGAAAGAGCTCAAGAACCTGCTGGCCTTTAGTCAACTACAGTGTGTAGTAATATTCGTCTTCCAGTTAACTCCAGTCCTG 
GTATCTGTGGTCACATTTTCTGTTTATGTCCTGGTGGATAGCAACAATATTTTGGATGCACAAAAGGCCTTCACCTCCAT 
TACCCTCTTCAATATCCTGCGCTTTCCCCTGAGCATGCTTCCCATGATGATCTCCTCCATGCTCCAGGCCAGTGTTTCCA 

10 CAGAGCGGCTAGAGAAGTACTTGGGAGGGGATGACTTGGACACATCTGCCATTCGACATGACTGCAATTTTGACAAAGCC 
ATGCAGTTTTCTGAGGCCTCCTTTACCTGGGAACATGATTCGGAAGCCACAGTCCGAGATGTGAACCTGGACATTATGGC 
AGGCCAACTTGTGGCTGTGATAGGCCCTGTCGGCTCTGGGAAATCCTCCTTGATATCAGCCATGCTGGGAGAAATGGAAA 
ATGTCCACGGGCACATCACCATCAAGGGCACCACTGCCTATGTCCCACAGCAGTCCTGGATTCAGAATGGCACCATAAAG 
GACAACATCCTTTTTGGAACAGAGTTTAATGAAAAGAGGTACCAGCAAGTACTGGAGGCCTGTGCTCTCCTCCCAGACTT 

15 GGAAATGCTGCCTGGAGGAGATTTGGCTGAGATTGGAGAGAAGGGTATAAATCTTAGTGGGGGTCAGAAGCAGCGGATCA 

•GCCTGGCCAGAGCTACCTACCAAAATTTAGACATCTATCTTCTAGATGACCCCCTGTCTGCAGTGGATGCTCATGTAGGA 
AAACATATTTTTAATAAGGTCTTGGGCCCCAATGGCCTGTTGAAAGGCAAGACTCGACTCTTGGTTACACATAGCATGCA 
CTTTCTTCCTCAAGTGGATGAGATTGTAGTTCTGGGGAATGGAACAATTGTAGAGAAAGGATCCTACAGTGCTCTCCTGG 
CCAAAAAAGGAGAGTTTGCTAAGAATCTGAAGACATTTCTAAGACATACAGGCCCTGAAGAGGAAGCCACAGTCCATGAT 
20 GGCAGTGAAGAAGAAGACGATGACTATGGGCTGATATCCAGTGTGGAAGAGATCCCCGAAGATGCAGCCTCCATAACCAT 
GAGAAGAGAGAACAGCTTTCGTCGAACACTTAGCCGCAGTTCTAGGTCCAATGGCAGGCATCTGAAGTCCCTGAGAAACT 
CCTTGAAAACTCGGAATGTGAATAGCCTGAAGGAAGACGAAGAACTAGTGAAAGGACAAAAACTAATTAAGAAGGAATTC 
ATAGAAACTGGAAAGGTGAAGT T CT C CAT CTAC CTGGAGTAC CTACAAG C AATAGGAT TGT TTTCGATATT CT T CAT CAT 
CCTTGCGTTTGTGATGAATTCTGTGGCTTTTATTGGATCCAACCTCTGGCTCAGTGCTTGGACCAGTGACTCTAAAATCT 
25 TCAATAGCACCGACTATCCAGCATCTCAGAGGGACATGAGAGTTGGAGTCTACGGAGCTCTG'GGATTAGCCCAAGGTATA 
TTTGTGTTCATAGC^CATTTCTGGAGTGCCTTTGGTTTCGTCCATGCATC^AATATCTTGCACAAGCAACTGCTGAACAA 
TAT C CT T C GAGCAC CT ATGAGAT T T TTTGAC AC AACAC C C ACAGGC CGGAT TGTGAACAGGTTTGC CGG CGAT AT TT C CA 
CAGTGGATGACACCCTGCCTCAGTCCTTGCGCAGCTGGATTACATGCTTCCTGGGGATAATCAGCACCCTTGTCATGATC 
TGCATGGCCACTCCTGTCTTCACCATCATCGTCATTCCTCTTGGCATTATTTATGTATCTGTTCAGATGTTTTATGTGTC 
30 TACCTCCCGCCAGCTGAGGCGTCTQGACTCTGTCACCAGGTCCCCAATCTACTCTCACTTCAGCGAGACCGTATCAGGTT 

• TGCCAGTTATCCGTGCCTTTGAGCACCAGCAGCGATTTCTGAAACACAATGAGGTGAGGATTGACACCAACCAGAAATGT 
GTCTTTTCCTGGATCACCTCCAACAGGTGGCTTGCAATTCGCCTGGAGCTGGTTGGGAACCTGACTGTCTTCTTTTCAGC 
CTTGATGATGGTTATTTATAGAGATACCCTAAGTGGGGACACTGTTGGCTTTGTTCTGTCCAATGCACTCAATATCACAC 
AAAC C C TGAACTGG C TGGTGAGGATGACAT C AGAAATAGAGAC C AACATT GTGGC TGTTGAGCGAATAACTGAGTACACA 
35 AAAGTGGAAAATGAGGCACCCTGGGTGACTGATAAGAGGCCTCCGCCAGATTGGCCCAGCAAAGGCAAGATCCAGTTTAA 
CAACTACCAAGTGCGGTACCGACCTGAGCTGGATCTGGTCCTCAGAGGGATCACTTGTGACATTGGTAGCATGGAGAAGA 
TTGGTGTGGTGGGCAGGACAGGAGCTGGAAAGTCATCCCTCACAAACTGCCTCTTCAGAATCTTAGAGGCTGCCGGTGGT 
CAGATTATCATTGATGGAGTAGATATTGCTTCCATTGGGCTCCACGACCTCCGAGAGAAGCTGACCATCATCCCCCAGGA 
CCCCATCCTGTTCTCTGGAAGCCTGAGGATGAATCTCGACCCTTTCAACAACTACTCAGATGAGGAGATTTGGAAGGCCT 
40 TGGAGCTGGCTCACCTCAAGTCTTTTGTGGCCAGCCTGCAACTTGGGTTATCCCACGAAGTGACAGAGGCTGGTGGCAAC 
CTGAGCATAGGCCAGAGGCAGCTGCTGTGCCTGGGCAGGGCTCTGCTTCGGAAATCCAAGATCCTGGTCCTGGATGAGGC 
CACTGCTGCGGTGGATCTAGAGACAGACAACCTCATTCAGACGACCATCCAAAACGAGTTCGCCCACTGCACAGTGATCA 
CCATCGCCCACAGGCTGCACACCATCATGGACAGTGACAAGGTAATGGTCCTAGACAACGGGAAGATTATAGAGTGCGGC 
AGCCCTGAAGAACTGCTACAAATCCCTGGACCCTTTTACTTTATGGCTAAGGAAGCTGGCATTGAGAATGTGAACAGCAC 
45 AAAATT C TAG 
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Fig. 9 

GACGGATCGGGAGATCTCCCGATCCCCTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTAT 
5 CTGCTCCCTGCTTGTGTGTTGGAGGTCGCTGAGTAGTGCGCGAGCAAAATTTAAGCTACAACAAGGCAAGGCTTGACCGA 
CAATTGCATGAAGAATCTGCTTAGGGTTAGGCGTTTTGCGCTGCTTCGCGATGTACGGGCCAGATATACGCGTTGACATT 
GATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAA 
CTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGT 
AACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGT 
10 ATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTA 
TGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAA 
TGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACC 
AAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAG 
GTCTATATAAGCAGAGCTCTCTGGCTAACTAGAGAACCCACTGCTTACTGGCTTATCGAAATTAATACGACTCACTATAG 
GGAGACCCAAGCTGGCTAGCGTTTAAACTTAAGCTTGGTACCGAGCTCGGATCCCAGGAGGATGGAGGCCCACAACGCGT 
icTGCCCCATTCAACTTCACCCTGCCACCCAACTTTGGCAAGCGCCCCACAGACCTGGCACTGAGCGTCATCCTGGTGTTC 
ATGTTGTTCTTCATCATGCTCTCGCTGGGCTGCACCATGGAGTTCAGCAAGATCAAGGCTCACTTATGGAAGCCTAAAGG 
GCTGGCCATCGCCCTGGTGGCACAGTATGGCATCATGCCCCTCACGGCCTTTGTGCTGGGCAAGGTCTTCCGGCTGAAGA 
ACATTGAGGCACTGGCCATCTTGGTCTGTGGCTGCTCACCTGGAGGGAACCTGTCCAATGTCTTCAGTCTGGCCATGAAG 
20 GGGGACATGAACCTCAGCATTGTGATGACCACCTGCTCCACCTTCTGTGCCCTTGGCATGATGCCTCTCCTCCTGTACAT 
CTACTCCAGGGGGATCTATGATGGGGACCTGAAGGACAAGGTGCCCTATAAAGGCATCGTGATATCACTGGTCCTGGTTC 
TCATTCCTTGCACCATAGGGATCGTCCTCAAATCTAAACGGCCACAATACATGCGCTATGTCATCAAGGGAGGGATGATC 
ATCATTCTCTTGTGCAGTGTGGCCGTCACAGTTCTCTCTGCCATCAATGTGGGGAAGAGCATCATGTTTGCCATGACACC 
ACTCTTGATTGCCACCTCCTCCCTGATGCCTTCTATTGGCTTTCTGCTGGGTTATGTTCTCTCTGCTCTCTTCTGCCTCA 
25 ATGGACGGTGCAGACGCACTGTCAGCATGGAGACTGGATGCCAAAATGTCCAACTCTGTTCCACCATCCTCAATGTGGCC 
TTTCCACCTGAAGTCATTGGACCACTTTTCTTCTTTCCCCTCCTCTACATGATTTTCCAGCTTGGAGAAGGGCTTCTCCT 
CATTGCCATATTTTGGTGCTATGAGAAATTCAAGACTCCCAAGGATAAAACAAAAATGATCTACACAGCTGCCACAACTG 
AAGAAACAATTCCAGGAGCTCTGGGAAATGGCACCTACAAAGGGGAGGACTGCTCCCCTTGCACAGCCTAGCCCTTCTAG 
AGGGCCCGTTTAAACCCGCTGATCAGCCTCGACTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGC 
CTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGG 

'tgtcattctattctggggggtggggtggggcaggacagcaagggggaggattgggaagacaatagcaggcatgctgggga 

TGCGGTGGGCTCTATGGCTTCTGAGGCGGAAAGAACCAGCTGGGGCTCTAGGGGGTATCCCCACGCGCCCTGTAGCGGCG 
CATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTXTCGCT 
TTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATT 
35 TAGTGCTTTACGGCACCTCGACCCCAAAT^AACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGG 
TTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATC 
TCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATT 
TAACGCGAATTAATTCTGTGGAATGTGTGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCA 
AAGCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGC 
40 ATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCT 
CCGCCCCATGGCTGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCTGCCTCTGAGCTATTCCAGAAGTAGTG 
AGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTCCCGGGAGCTTGTATATCCATTTTCGGATCTGATCAAGAGA 
CAGGATGAGGATCGTTTCGCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTC 
GGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCT 
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TTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGG 
GCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAG 
GATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGA 
TCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCG 
5 ATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGAC 
GGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCAT 
CGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCG 
GCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTT 
GACGAGTTCTTCTGAGCGGGACTCTGGGGTTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGAGATTTCGAT 

10 TCCACCGCCGCCTTCTATGAAAGGTTGGGCTTCGGAATCGTTTTCCGGGACGCCGGCTGGATGATCCTCCAGCGCGGGGA 
TCTCATGCTGGAGTTCTTCGCCCACCCCAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAA 
ATTTCACAAATAAAGCATTTTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTGT 
ATACCGTCGACCTCTAGCTAGAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAAT 
TCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGT 

15 TGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGC 
^^^^^GGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATC 
mT j«AGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGC 
^^^^AAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAAT 
CGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCG 

20 CTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCT 
CACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGAC 
CGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGG 
TAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAA 
GAACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAA 

25 ACCACCGCTGGTAGCGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGAT 
CTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCT 
T CACCTAGAT C CTTTT AAATT AAAAATGAAGT TTTAAATCAAT CTAAAGTATATATGAGT AAAC TTGGT CTGAC AGTTAC 
CAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTA 
GATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAG 

30 ATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCT 

• ATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCAT 
CGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCA 
TGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATG 
GTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAA 
35 GTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCA 
GAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGT 
TCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGG 
AAGGC AAAATGC CGC AAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACT CAT AC T CT T C C TT TTT C AATAT TATT 
GAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCG 
40 CGCACATTTCCCCGAAAAGTGCCACCTGACGTC 
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Fig. 10 

GACGGATCGGGAGATCTCCCGATCCCCTATGGTCGACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTAT 
CTGCTCCCTGCTTGTGTGTTGGAGGTCGCTGAGTAGTGCGCGAGCAAAATTTAAGCTACAACAAGGCAAGGCTTGACCGA 
5 CAATTGCATGAAGAATCTGCTTAGGGTTAGGCGTTTTGCGCTGCTTCGCGATGTACGGGCCAGATATACGCGTTGACATT 
GATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAA 
CTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGT 
AACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGT 
ATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTA 
10 TGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAA 
TGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACC 
AAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAG 
GTCTATATAAGCAGAGCTCTCTGGCTAACTAGAGAACCCACTGCTTACTGGCTTATCGAAATTAATACGACTCACTATAG 
GGAGACCCAAGCTGGCTAGCGTTTA7^ACTTAAGCTATCACAAGTTTGTACAAAAAAGCAGGCTTAGGAATGTCTGACTCA 
1<Z GTAATTCTTCGAAGTATAAAGAAATTTGGAGAGGAGAATGATGGTTTTGAGTCAGATAAATCATATAATAATGATAAGAA 
■^^ATCAAGGTTACAAGATGAGAAGAAAGGTGATGGCGTTAGAGTTGGCTTCTTTCAATTGTTTCGGTTTTCTTCATCAACTG 
^^ACATTTGGCTGATGTTTGTGGGAAGTTTGTGTGCATTTCTCCATGGAATAGCCCAGCCAGGCGTGCTACTCATTTTTGGC 
ACAATGACAGATGTTTTTATTGACTACGACGTTGAGTTACAAGAACTCCAGATTCCAGGAAAAGCATGTGTGAATAACAC 
CATTGTATGGACTAACAGTTCCCTCAACCAGAACATGACAAATGGAACACGTTGTGGGTTGCTGAACATCGAGAGCGAAA 
20 TGATCAAATTTGCCAGTTACTATGCTGGAATTGCTGTCGCAGTACTTATCACAGGATATATTCAAATATGCTTTTGGGTC 
ATTGCCGCAGCTCGTCAGATACAGAAAATGAGAAAATTTTACTTTAGGAGAATAATGAGAATGGAAATAGGGTGGTTTGA 
CTGCAATTCAGTGGGGGAGCTGAATACAAGATTCTCTGATGATATTAATAAAATCAATGATGCCATAGCTGACCAAATGG 
CCCTTTTCATTCAGCGCATGACCTCGACCATCTGTGGTTTCCTGTTGGGATTTTTCAGGGGTTGGAAACTGACCTTGGTT 
ATTATTTCTGTCAGCCCTCTCATTGGGATTGGAGCAGCCACCATTGGTCTGAGTGTGTCCAAGTTTACGGACTATGAGCT 
25 GAAGGCCTATGCCAAAGCAGGGGTGGTGGCTGATGAAGTCATTTCATCAATGAGAACAGTGGCTGCTTTTGGTGGTGAGA 
AAAGAGAGGTTGAAAGGTATGAGAAAAATCTTGTGTTCGCCCAGCGTTGGGGAATTAGAAAAGGAATAGTGATGGGATTC 
TTTACTGGATTCGTGTGGTGTCTCATCTTTTTGTGTTATGCACTGGCCTTCTGGTACGGCTCCACACTTGTCCTGGATGA 
AGGAGAATATACACCAGGAACCCTTGTCCAGATTTTCCTCAGTGTCATAGTAGGAGCTTTAAATCTTGGCAATGCCTCTC 
CTTGTTTGGAAGCCTTTGCAACTGGACGTGCAGCAGCCACCAGCATTTTTGAGACAATAGACAGGAAACCCATCATTGAC 
TGCATGTCAGAAGATGGTTACAAGTTGGATCGAATCAAGGGTGAARTTGAATTCCATAATGTGACCTTCCATTATCCTTC 
I CAGACCAGAGGTGAAGATTCTAAATGACCTCAACATGGCCATTAAACCAGGGGAAATGACAGCTCTGGTAGGACCCAGTG 
GAGCTGGAAAAAGTACAGCACTGCAACTCATTCAGCGATTCTATGACCCCTGTGAAGGAATGGTGACCGTGGATGGCCAT 
GACATTCGCTCTCTTAACATTCAGTGGCTTAGAGATCAGATTGGGATAGTGGAGCAAGAGCCAGTTCTGTTCTCTACCAC 
CATTGCAGAAAATATTCGCTATGGCAGAGAAGATGCAACAATGGAAGACATAGTCCAAGCTGCCAAGGAGGCCAATGCCT 
35 ACAACTTCATCATGGACCTGCCACAGCAATTTGACACCCTTGTTGGAGAAGGAGGAGGCCAGATGAGTGGTGGCCAGAAA 
CAAAGGGTAGCTATCGCCAGAGCCCTCATCCGAAATCCCAAGATTCTGCTTTTGGACATGGCCACCTCAGCTCTGGACAA 
TGAGAGTGAAGCCATGGTGCAAGAAGTGCTGAGTAAGATTCAGCATGGGCACACAATCATTTCAGTTGCTCATCGCTTGT 
CTACGGTCAGAGCTGCAGATACCATCATTGGTTTTGAACATGGCACTGCAGTGGAAAGAGGGACCCATGAAGAATTACTG 
GAAAGGAAAGGTGTTTACTTCACTCTAGTGACTTTGCAAAGCCAGGGAAATCAAGCTCTTAATGAAGAGGACATAAAGGA 
40 TGCAACTGAAGATGACATGCTtGCGAGGACCTTTAGCAGAGGGAGCTACCAGGATAGTTTAAGGGCTTCCATCCGGCAAC 
GCTCCAAGTCTCAGCTTTCTTACCTGGTGCACGAACCTCCATTAGCTGTTGTAGATCATAAGTCTACCTATGAAGAAGAT 
AGAAAGGACAAGGACATTCCTGTGCAGGAAGAAGTTGAACCTGCCCCAGTTAGGAGGATTCTGAAATTCAGTGCTCCAGA 
ATGGCCCTACATGCTGGTAGGGTCTGTGGGTGCAGCTGTGAACGGGACAGTCACACCCTTGTATGCCTTTTTATTCAGCC 
AGATTCTTGGGACTTTTTCAATTCCTGATAAAGAGGAACAAAGGTCACAGATCAATGGTGTGTGCCTACTTTTTGTAGCA 
45 ATGGGCTGTGTATCTCTTTTCACCCAATTTCTACAGGGATATGCCTTTGCTAAATCTGGGGAGCTCCTAACAAAAAGGCT 
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ACGTAAATTTGGTTTCAGGGCAATGCTGGGGCAAGATATTGCCTGGTTTGATGACCTCAGAAATAGCCCTGGAGCATTGA 
CAACAAGACTTGCTACAGATGCTTCCCAAGTTCAAGGGGCTGGCGGCTCTCAGATCGGGATGATAGTCAATTCCTTCACT 
AACGTCACTGTGGCCATGATCATTGCCTTCTCCTTTAGCTGGAAGCTGAGCCTGGTCATCTTGTGCTTCTTCCCCTTCTT 
GGCTTTATCAGGAGCCACACAGACCAGGATGTTGACAGGATTTGCCTCTCGAGATAAGCAGGCCCTGGAGATGGTGGGAC 
5 AGATTACAAATGAAGCCCTCAGTAACATCCGCACTGTTGCTGGAATTGGAAAGGAGAGGCGGTTCATTGAAGCACTTGAG 
ACTGAGCTGGAGAAGCCCTTCAAGACAGCCATTCAGAAAGCCAATATTTACGGATTCTGCTTTGCCTTTGCCCAGTGCAT 
CATGTTTATTGCGAATTCTGCTTCCTACAGATATGGAGGTTACTTAATCTCCAATGAGGGGCTCCATTTCAGCTATGTGT 
TCAGGGTGATCTCTGCAGTTGTACTGAGTGCAACAGCTCTTGGAAGAGCCTTCTCTTACACCCCAAGTTATGCAAAAGCT 
AAAATATCAGCTGCACGCTTTTTTCAACTGCTGGACCGACAACCCCCAATCAGTGTATACAATACTGCAGGTGAAAAATG 
10 GGACAACTTCCAGGGGAAGATTGATTTTGTTGATTGTAAATTTACATATCCTTCTCGACCTGACTCGCAAGTTCTGAATG 
GTCTCTCAGTGTCGATTAGTCCAGGGCAGACACTGGCGTTTGTTGGGAGCAGTGGATGTGGCAAAAGCACTAGCATTCAG 
CTGTTGGAACGTTTCTATGATCCTGATCAAGGGAAGGTGATGATAGATGGTCATGACAGCAAAAAAGTAAATGTCCAGTT 
CCTCCGCTCAAACATTGGAATTGTTTCCCAGGAACCAGTGTTGTTTGCCTGTAGCATAATGGACAATATCAAGTATGGAG 
ACAACACCAAAGAAATTCCCATGGAAAGAGTCATAGCAGCTGCAAAACAGGCTCAGCTGCATGATTTTGTCATGTCACTC 
15 CCAGAGAAATATGAAACTAACGTTGGGTCCCAGGGGTCTCAACTCTCTAGAGGGGAGAAACAACGCATTGCTATTGCTCG 
jgfc^GGCCATTGTACGAGATCCTAAAATCTTGCTACTAGATGAAGCCACTTCTGCCTTAGACACAGAAAGTGAAAAGACGGTGC 
fl J«AGGTTGCTCTAGACAAAGCCAGAGAGGGTCGGACCTGCATTGTCATTGCCCATCGCTTGTCCACCATCCAGAACGCGGAT 
^^^ATCATTGCTGTCATGGCACAGGGGGTGGTGATTGAAAAGGGGACCCATGAAGAACTGATGGCCCAAAAAGGAGCCTACTA 
CAAACTAGTCACCACTGGATCCCCCATCAGTTGAGACCCAGCTTTCTTGTACAAAGTGGTGATTGGTACCGAGCTCGGAT 
20 CCACTAGTCCAGTGTGGTGGAATTCTGCAGATATCCAGCACAGTGGCGGCCGCTCGAGTCTAGAGGGCCCGTTTAAACCC 
GCTGATCAGCCTCGACTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGACCCTGGAA 
GGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGG 
GGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGG 
CTTCTGAGGCGGAAAGAACCAGCTGGGGCTCTAGGGGGTATCCCCACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGT 
25 GTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCT 
CGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGCATCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACC 
TCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACG 
TTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGA 
TTTATAAGGGATTTTGGGGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTAATTCT 
30 GTGGAATGTGTGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGGCAGGCAGAAGTATGCAAAGCATGCATCTCAAT 

• TAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGC 
AACCATAGTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGAC 
TAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCTGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGA 
GGCCTAGGCTTTTGCAAAAAGCTCCCGGGAGCTTGTATATCCATTTTCGGATCTGATCAGCACGTGTTGACAATTAATCA 
35 TCGGCATAGTATATCGGCATAGTATAATACGACAAGGTGAGGAACTAAACCATGGCCAAGTTGACCAGTGCCGTTCCGGT 
GCTCACCGCGCGCGACGTCGCCGGAGCGGTCGAGTTCTGGACCGACCGGCTCGGGTTCTCCCGGGACTTCGTGGAGGACG 
ACTTCGCCGGTGTGGTCCGGGACGACGTGACCCTGTTCATCAGCGCGGTCCAGGACCAGGTGGTGCCGGACAACACCCTG 
GCCTGGGTGTGGGTGCGCGGCCTGGACGAGCTGTACGCCGAGTGGTCGGAGGTCGTGTCCACGAACTTCCGGGACGCCTC 
CGGGCCGGCCATGACCGAGATCGGCGAGCAGCCGTGGGGGCGGGAGTTCGCCCTGCGCGACCCGGCCGGCAACTGCGTGC 
40 ACTTCGTGGCCGAGGAGCAGGACTGACACGTGCTACGAGATTTCGATTCCACCGCCGCCTTCTATGAAAGGTTGGGCTTC 
GGAATCGTTTTCCGGGACGCCGGCTGGATGATCCTCCAGCGCGGGGATCTCATGCTGGAGTTCTTCGCCCACCCCAACTT 
GTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATTTTTTTCACTGCATT 
CTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTGTATACCGTCGACCTCTAGCTAGAGCTTGGCGTAA 
TCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTG 
45 TAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACC 
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TGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCG 
CTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCAC 
AGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGC 
TGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACA 
5 GGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATA 
CCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCG 
TTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAG 
TCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGG 
TGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGC 
10 CAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGC 
AAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAA 
CGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAA 
GTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCA 
GCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATC 
15 TGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAA 
^|^GGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGT 
^•^■ AGTT CGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGC 
^^^TTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCG 
GTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACT 
20 GTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACC 
GAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAAC 
GTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGA 
TCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAG 
GGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGA 
25 G CGGATACATATTTGAATGT ATTTAGAAAAAT AAAC AAATAGGGGT T C CGCGC ACAT TTC C C CGAAAAGTGC C AC CTGAC 
GTC 

Fig. 11 

, GA CGGATCGGGAGATCTCCCGATCCCCTATGGTCGACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTAT 
"'■f^W CTGCTCCCTGCTTGTGTGTTGGAGGTCGCTGAGTAGTGCGCGAGCAAAATTTAAGCTACAACAAGGCAAGGCTTGACCGA 
CAATTGCATGAAGAATCTGCTTAGGGTTAGGCGTTTTGCGCTGCTTCGCGATGTACGGGCCAGATATACGCGTTGACATT 
GATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAA 
CTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGT 
35 AACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGT 
ATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTA 
TGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAA 
TGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACC 
AA7^TCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAG 
40 GTCTATATAAGCAGAGCTCTCTGGCTAACTAGAGAACCCACTGCTTACTGGCTTATCGAAATTAATACGACTCACTATAG 
GGAGACCCAAGCTGGCTAGCAGTCGAGGAATCATGCTGGAGAAGTTCTGCAACTCTACTTTTTGGAATTCCTCATTCCTG 
GACAGTCCGGAGGCAGACCTGCCACTTTGTTTTGAGCAAACTGTTCTGGTGTGGATTCCCTTGGGCTTCCTATGGCTCCT 
GGCCCCCTGGCAGCTTCTCCACGTGTATAAATCCAGGACCAAGAGATCCTCTACCACCAAACTCTATCTTGCTAAGCAGG 
TATTCGTTGGTTTTCTTCTTATTCTAGCAGCCATAGAGCTGGCCCTTGTACTCACAGAAGACTCTGGACAAGCCACAGTC 
45 CCTGCTGTTCGATATACCAATCCAAGCCTCTACCTAGGCACATGGCTCCTGGTTTTGCTGATCCAATACAGCAGACAATG 
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GTGTGTACAGAAAAACTCCTGGTTCCTGTCCCTATTCTGGATTCTCTCGATACTCTGTGGCACTTTCCAATTTCAGACTC 
TGATCCGGACACTCTTACAGGGTGACAATTCTAATCTAGCCTACTCCTGCCTGTTCTTCATCTCCTACGGATTCCAGATC 
CTGATCCTGATCTTTTCAGCATTTTCAGAAAATAATGAGTCATCAAATAATCCATCATCCATAGCTTCATTCCTGAGTAG 
CATTACCTACAGCTGGTATGACAGCATCATTCTGAAAGGCTACAAGCGTCCTCTGACACTCGAGGATGTCTGGGAAGTTG 
5 ATGAAGAGATGAAAACCAAGACATTAGTGAGCAAGTTTGAAACGCACATGAAGAGAGAGCTGCAGAAAGCCAGGCGGGCA 
CTCCAGAGACGGCAGGAGAAGAGCTCCCAGCAGAACTCTGGAGCCAGGCTGCCTGGCTTGAACAAGAATCAGAGTCAAAG 
CCAAGATGCCCTTGTCCTGGAAGATGTTGAAAAGAAAAAAAAGAAGTCTGGGACCAAAAAAGATGTTCCAAAATCCTGGT 
TGATGAAGGCTCTGTTCAAAACTTTCTACATGGTGCTCCTGAAATCATTCCTACTGAAGCTAGTGAATGACATCTTCACG 
TTTGTGAGTCCTCAGCTGCTGAAATTGCTGATCTCCTTTGCAAGTGACCGTGACACATATTTGTGGATTGGATATCTCTG 

10 TGCAATCCTCTTATTCACTGCGGCTCTCATTCAGTCTTTCTGCCTTCAGTGTTATTTCCAACTGTGCTTCAAGCTGGGTG 
TAAAAGTACGGACAGCTATCATGGCTTCTGTATATAAGAAGGCATTGACCCTATCCAACTTGGCCAGGAAGGAGTACACC 
GTTGGAGAAACAGTGAACCTGATGTCTGTGGATGCCCAGAAGCTCATGGATGTGACCAACTTCATGCACATGCTGTGGTC 
AAGTGTTCTACAGATTGTCTTATCTATCTTCTTCCTATGGAGAGAGTTGGGACCCTCAGTCTTAGCAGGTGTTGGGGTGA 
TGGTGCTTGTAAT C C CAATTAATGCGATACTGT C C AC C AAGAGTAAGAC C ATTC AGGT CAAAAATATGAAGAATAAAGAC 

15 AAACGTTTAAAGATCATGAATGAGATTCTTAGTGGAATCAAGATCCTGAAATATTTTGCCTGGGAACCTTCATTCAGAGA 

• CCAAGTACAAAACCTCCGGAAGAAAGAGCTCAAGAACCTGCTGGCCTTTAGTCAACTACAGTGTGTAGTAATATTCGTCT 
TCCAGTTAACTCCAGTCCTGGTATCTGTGGTCACATTTTCTGTTTATGTCCTGGTGGATAGCAACAATATTTTGGATGCA 
CAAAAGGCCTTCACCTCCATTACCCTCTTCAATATCCTGCGCTTTCCCCTGAGCATGCTTCCCATGATGATCTCCTCCAT 
GCTCCAGGCCAGTGTTTCCACAGAGCGGCTAGAGAAGTACTTGGGAGGGGATGACTTGGACACATCTGCCATTCGACATG 
20 ACTGCAATTTTGACAAAGCCATGCAGTTTTCTGAGGCCTCCTTTACCTGGGAACATGATTCGGAAGCCACAGTCCGAGAT 
GTGAACCTGGACATTATGGCAGGCCAACTTGTGGCTGTGATAGGCCCTGTCGGCTCTGGGAAATCCTCCTTGATATCAGC 
CATGCTGGGAGAAATGGAAAATGTCCACGGGCACATCACCATCAAGGGCACCACTGCCTATGTCCCACAGCAGTCCTGGA 
TTCAGAATGGCACCATAAAGGACAACATCCTTTTTGGAACAGAGTTTAATGAAAAGAGGTACCAGCAAGTACTGGAGGCC 
TGTGCTCTCCTCCCAGACTTGGAAATGCTGCCTGGAGGAGATTTGGCTGAGATTGGAGAGAAGGGTATAAATCTTAGTGG 
25 GGGTCAGAAGCAGCGGATCAGCCTGGCCAGAGCTACCTACCAAAATTTAGACATCTATCTTCTAGATGACCCCCTGTCTG 
CAGTGGATGCTCATGTAGGAAAACATATTTTTAATAAGGTCTTGGGCCCCAATGGCCTGTTGAAAGGCAAGACTCGACTC 
TTGGTTACACATAGCATGCACTTTCTTCCTCAAGTGGATGAGATTGTAGTTCTGGGGAATGGAACAATTGTAGAGAAAGG 
AT C CTACAGTGC T C T C C TGGC CAAAAAAGGAGAGTTTGCTAAGAAT CTGAAGAC ATTT CTAAGAC ATAC AGGC C CTGAAG 
AGGAAG C CAC AGT CC ATGATGGCAGTGAAGAAGAAGACGATGACTATGGGCTGATAT C C AGTGTGGAAGAGAT C C C CGAA 
30 GATGCAGCCTCCATAACCATGAGAAGAGAGAACAGCTTTCGTCGAACACTTAGCCGCAGTTCTAGGTCCAATGGCAGGCA 
TCTGAAGTCCCTGAGAAACTCCTTGAAAACTCGGAATGTGAATAGCCTGAAGGAAGACGAAGAACTAGTGAAAGGACAAA 
AACTAATTAAGAAGGAATTCATAGAAACTGGAAAGGTGAAGTTCTCCATCTACCTGGAGTACCTACAAGCAATAGGATTG 
TTTTCGATATTCTTCATCATCCTTGCGTTTGTGATGAATTCTGTGGCTTTTATTGGATCCAACCTCTGGCTCAGTGCTTG 
GACCAGTGACTCTAAAATCTTCAATAGCACCGACTATCCAGCATCTCAGAGGGACATGAGAGTTGGAGTCTACGGAGCTC 
35 TGGGATTAGCCCAAGGTATATTTGTGTTCATAGCACATTTCTGGAGTGCCTTTGGTTTCGTCCATGCATCAAATATCTTG 
CACAAGCAACTGCTGAACAATATCCTTCGAGCACCTATGAGATTTTTTGACACAACACCCACAGGCCGGATTGTGAACAG 
GTTTGCCGGCGATATTTCCACAGTGGATGACACCCTGCCTCAGTCCTTGCGCAGCTGGATTACATGCTTCCTGGGGATAA 
TCAGCACCCTTGTCATGATCTGCATGGCCACTCCTGTCTTCACCATCATCGTCATTCCTCTTGGCATTATTTATGTATCT 
GTTCAGATGTTTTATGTGTCTACCTCCCGCCAGCTGAGGCGTCTGGACTCTGTCACCAGGTCCCCAATCTACTCTCACTT 
40 CAGCGAGACCGTATCAGGTTTGCCAGTTATCCGTGCCTTTGAGCACCAGCAGCGATTTCTGAAACACAATGAGGTGAGGA 
TTGACACCAACCAGAAATGTGTCTTTTCCTGGATCACCTCCAACAGGTGGCTTGCAATTCGCCTGGAGCTGGTTGGGAAC 
CTGACTGTCTTCTTTTCAGCCTTGATGATGGTTATTTATAGAGATACCCTAAGTGGGGACACTGTTGGCTTTGTTCTGTC 
CAATGCACTCAATATCACACAAACCCTGAACTGGCTGGTGAGGATGACATCAGAAATAGAGACCAACATTGTGGCTGTTG 
AGCGAATAACTGAGTAC ACAAAAGTGGAAAATGAGGCAC C CTGGGTGAC TGATAAGAGGC CT C CGC CAGATTGGCC CAGC 
45 AAAGGCAAGAT C CAGTT TAACAACTACC AAGTGCGGTAC CGAC CTGAGC TGGAT CTGGTC CT CAGAGGGAT CACT TGTGA 
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CATTGGTAGCATGGAGAAGATTGGTGTGGTGGGCAGGACAGGAGCTGGAAAGTCATCCCTCACAAACTGCCTCTTCAGAA 
TCTTAGAGGCTGCCGGTGGTCAGATTATCATTGATGGAGTAGATATTGCTTCCATTGGGCTCCACGACCTCCGAGAGAAG 
CTGACCATCATCCCCCAGGACCCCATCCTGTTCTCTGGAAGCCTGAGGATGAATCTCGACCCTTTCAACAACTACTCAGA 
TGAGGAGATTTGGAAGGCCTTGGAGCTGGCTCACCTCAAGTCTTTTGTGGCCAGCCTGCAACTTGGGTTATCCCACGAAG 
5 TGACAGAGGCTGGTGGCAACCTGAGCATAGGCCAGAGGCAGCTGCTGTGCCTGGGCAGGGCTCTGCTTCGGAAATCCAAG 
ATCCTGGTCCTGGATGAGGCCACTGCTGCGGTGGATCTAGAGACAGACAACCTCATTCAGACGACCATCCAAAACGAGTT 
CGCCCACTGCACAGTGATCACCATCGCCCACAGGCTGCACACCATCATGGACAGTGACAAGGTAATGGTCCTAGACAACG 
GGAAGATTATAGAGTGCGGCAGCCCTGAAGAACTGCTACAAATCCCTGGACCCTTTTACTTTATGGCTAAGGAAGCTGGC 
ATTGAGAATGTGAACAGCACAAAATTCTAGCTTAAGCTTGGTACCGAGCTCGGATCCACTAGTCCAGTGTGGTGGAATTC 

10 TGCAGATATCCAGCACAGTGGCGGCCGCTCGAGTCTAGAGGGCCCGTTTAAACCCGCTGATCAGCCTCGACTGTGCCTTC 
TAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCT 
AATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAG 
GGGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGATGCGGTGGGCTCTATGGCTTCTGAGGCGGAAAGAACCAGCTG 
GGGCTCTAGGGGGTATCCCCACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCG 

15 CTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGT 

( ^ ^ c ^ gctCt; ^ tcggggcatcccttt agggttccgatttagtgctttacg 

w* ^ TGATGGTTCACGTAGTGGGCCATCGCCC 

gact cttgttccaaactggaacaacactcaaccctatctcggtctattcttttgatttataagggattttggggatttcg 

gcctattggttaaaaaatgagctgatttaacaaaaatttaacgcgaattaattctgtggaatgtgtgtcagttagggtgt 

20 GGAAAGTCCCCAGGCTCCCCAGGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAGT 
CCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCG 
CCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAATTTTTTTTATTTATGCAGAGGC 
CGAGGCCGCCTCTGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTCC 
CGGGAGCTTGTATATCCATTTTCGGATCTGATCAGCACGTGTTGACAATTAATCATCGGCATAGTATATCGGCATAGTAT 
25 AATACGACAAGGTGAGGAACTAAACCATGGCCAAGTTGACCAGTGCCGTTCCGGTGCTCACCGCGCGCGACGTCGCCGGA 
GCGGTCGAGTTCTGGACCGACCGGCTCGGGTTCTCCCGGGACTTCGTGGAGGACGACTTCGCCGGTGTGGTCCGGGACGA 
CGTGACCCTGTTCATCAGCGCGGTCCAGGACCAGGTGGTGCCGGACAACACCCTGGCCTGGGTGTGGGTGCGCGGCCTGG 
ACGAGCTGTACGCCGAGTGGTCGGAGGTCGTGTCCACGAACTTCCGGGACGCCTCCGGGCCGGCCATGACCGAGATCGGC 
GAGCAGCCGTGGGGGCGGGAGTTCGCCCTGCGCGACCCGGCCGGCAACTGCGTGCACTTCGTGGCCGAGGAGCAGGACTG 
ACACGTGCTACGAGATTTCGATTCCACCGCCGCCTTCTATGAAAGGTTGGGCTTCGGAATCGTTTTCCGGGACGCCGGCT 
GGATGATCCTCCAGCGCGGGGATCTCATGCTGGAGTTCTTCGCCCACCCCAACTTGTTTATTGCAGCTTATAATGGTTAC 
' AAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATTTTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCAT 
C AATGTAT CTTAT CATGT CTGTATACCGT CGAC C T CTAG CTAGAG CT TGG CGTAAT CATGGT CATAGCTGT TT C CTGTGT 

GAAATTGTTATCCGCTCACAATTCGACAGAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTG 
35 AGCTAACTGACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAAT 
CGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCG 
TTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAG 
AACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCC 
CCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTC 
40 ' CCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGA 
AGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCA 
CGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTAT 
CGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGG 
CCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGG 
45 TAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAA 




t 



41 



AAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTG 
GTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATA 
TGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCA 
TAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCG 
CGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGC 
AACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCA 
ACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGA 
TCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAA 
GTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTT 
CTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATA 
CGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAG 
GATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCA 
GCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTC 
ATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTA 
GAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTC 
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<110> Aventis Pharma Deutschland GmbH 

<120> Dual -transfected cell llines as in vitro screening tools for 
pharmaceutical compound profiling: A model for hepatobiliary 
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<212> DNA 

<213> Homo sapiens 

<400> 1 



caggaggatg 


gaggcccaca 


acgcgtctgc 


cccattcaac 


ttcaccctgc 


cacccaactt 


60 


fcagcaagcgc 


cccacagacc 


tggcactgag 


cgtcatcctg 


gtgttcatgt 


tgttcttcat 


120 


SMcgctctcg 


ctgggctgca 


ccatggagtt 


cagcaagatc 


aaggctcact 


tatggaagcc 


180 


taaagggctg 


gccatcgccc 


tggtggcaca 


gtatggcatc 


atgcccctca 


cggcctttgt 


240 


gctgggcaag 


gtcttccggc 


tgaagaacat 


tgaggcactg 


gccatcttgg 


tctgtggctg 


300 


ctcacctgga 


gggaacctgt 


ccaatgtctt 


cagtctggcc 


atgaaggggg 


acatgaacct 


360 


cagcattgtg 


atgaccacct 


gctccacctt 


ctgtgccctt 


ggcatgatgc 


ctctcctcct 


420 


gtacatctac 


tccaggggga 


tctatgatgg 


ggacctgaag 


gacaaggtgc 


cctataaagg 


480 


catcgtgata 


tcactggtcc 


tggttctcat 


tccttgcacc 


atagggatcg 


tcctcaaatc 


540 


taaacggcca 


caatacatgc 


gctatgtcat 


caagggaggg 


atgatcatca 


ttctcttgtg 


600 


cagtgtggcc 


gtcacagttc 


tctctgccat 


caatgtgggg 


aagagcatca 


tgtttgccat 


660 


gacaccactc 


ttgattgcca 


cctcctccct 


gatgccttct 


attggctttc 


tgctgggtta 


720 


tgttctctct 


gctctcttct 


gcctcaatgg 


acggtgcaga 


cgcactgtca 


gcatggagac 


780 


Bgatgccaa 


aatgtccaac 


tctgttccac 


catcctcaat 


gtggcctttc 


cacctgaagt 


840 


xattggacca 


cttttcttct 


ttcccctcct 


ctacatgatt 


ttccagcttg 


gagaagggct 


900 


tctcctcatt 


gccatatttt 


ggtgctatga 


gaaattcaag 


actcccaagg 


ataaaacaaa 


960 


aatgatctac 


acagctgcca 


caactgaaga 


aacaattcca 


ggagctctgg 


gaaatggcac 


1020 


ctacaaaggg 


gaggactgct 


ccccttgcac 


agcctagccc 


t 
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<210> 2 

<211> 3966 

<212> DNA 

<213> Homo sapiens 

<400> 2 

atgtctgact cagtaattct tcgaagtata aagaaatttg gagaggagaa tgatggtttt 60 

gagtcagata aatcatataa taatgataag aaatcaaggt tacaagatga gaagaaaggt 120 

gatggcgtta gagttggctt ctttcaattg tttcggtttt cttcatcaac tgacatttgg 180 
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ctgatgtttg tgggaagttt gtgtgcattt ctccatggaa tagcccagcc aggcgtgcta 240 

ctcatttttg gcacaatgac agatgttttt attgactacg acgttgagtt acaagaactc 300 

cagattccag gaaaagcatg tgtgaataac accattgtat ggactaacag ttccctcaac 360 

cagaacatga caaatggaac acgttgtggg ttgctgaaca tcgagagcga aatgatcaaa 420 

tttgccagtt actatgctgg aattgctgtc gcagtactta tcacaggata tattcaaata 480 

tgcttttggg tcattgccgc agctcgtcag atacagaaaa tgagaaaatt ttactttagg 540 

agaataatga gaatggaaat agggtggttt gactgcaatt cagtggggga gctgaataca 600 

agattctctg atgatattaa taaaatcaat gatgccatag ctgaccaaat ggcccttttc 660 

attcagcgca tgacctcgac catctgtggt ttcctgttgg gatttttcag gggttggaaa 720 

ctgaccttgg ttattatttc tgtcagccct ctcattggga ttggagcagc caccattggt 780 

ctgagtgtgt ccaagtttac ggactatgag ctgaaggcct atgccaaagc aggggtggtg 840 

tgatgaag tcatttcatc aatgagaaca gtggctgctt ttggtggtga gaaaagagag 900 

[ttgaaaggt atgagaaaaa tcttgtgttc gcccagcgtt ggggaattag aaaaggaata 960 

gtgatgggat tctttactgg attcgtgtgg tgtctcatct ttttgtgtta tgcactggcc 1020 

ttctggtacg gctccacact tgtcctggat gaaggagaat atacaccagg aacccttgtc 1080 

cagattttcc tcagtgtcat agtaggagct ttaaatcttg gcaatgcctc tccttgtttg 1140 

gaagcctttg caactggacg tgcagcagcc accagcattt ttgagacaat agacaggaaa 1200 

cccatcattg actgcatgtc agaagatggt tacaagttgg atcgaatcaa gggtgaaatt 1260 

gaattccata atgtgacctt ccattatcct tccagaccag aggtgaagat tctaaatgac 1320 

ctcaacatgg ccattaaacc aggggaaatg acagctctgg taggacccag tggagctgga 1380 

aaaagtacag cactgcaact cattcagcga ttctatgacc cctgtgaagg aatggtgacc 1440 

gtggatggcc atgacattcg ctctcttaac attcagtggc ttagagatca gattgggata 1500 

ggagcaag agccagttct gttctctacc accattgcag aaaatattcg ctatggcaga 1560 

agatgcaa caatggaaga catagtccaa gctgccaagg aggccaatgc ctacaacttc 1620 

atcatggacc tgccacagca atttgacacc cttgttggag aaggaggagg ccagatgagt 1680 

ggtggccaga aacaaagggt agctatcgcc agagccctca tccgaaatcc caagattctg 1740 

cttttggaca tggccacctc agctctggac aatgagagtg aagccatggt gcaagaagtg 1800 

ctgagtaaga ttcagcatgg gcacacaatc atttcagttg ctcatcgctt gtctacggtc 1860 

agagctgcag ataccatcat tggttttgaa catggcactg cagtggaaag agggacccat 1920 

gaagaattac tggaaaggaa aggtgtttac ttcactctag tgactttgca aagccaggga 1980 

aatcaagctc ttaatgaaga ggacataaag gatgcaactg aagatgacat gcttgcgagg 2040 

acctttagca gagggagcta ccaggatagt ttaagggctt ccatccggca acgctccaag 2100 

tctcagcttt cttacctggt gcacgaacct ccattagctg ttgtagatca taagtctacc 2160 

tatgaagaag atagaaagga caaggacatt cctgtgcagg aagaagttga acctgcccca 2220 
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gttaggagga 
ggtgcagctg 
gggacttttt 
ctttttgtag 
gctaaatctg 
gggcaagata 
cttgctacag 
aattccttca 
agcctggtca 
atgttgacag 
aatgaagccc 
.agcacttg 
.cggattct 
agatatggag 
atctctgcag 
tatgcaaaag 
atcagtgtat 
gttgattgta 
gtgtcgatta 
actagcattc 
ggtcatgaca 
caggaaccag 
.agaaattc 
fcatgtcac 
agaggggaga 
ctactagatg 
ctagacaaag 
cagaacgcgg 
gaagaactga 
agttga 




ttctgaaatt 
tgaacgggac 
caattcctga 
caatgggctg 
gggagctcct 
ttgcctggtt 
atgcttccca 
ctaacgtcac 
tcttgtgctt 
gatttgcctc 
tcagtaacat 
agactgagct 
gctttgcctt 
gttacttaat 
ttgtactgag 
ctaaaatatc 
acaatactgc 
aatttacata 
gtccagggca 
agctgttgga 
gcaaaaaagt 
tgttgtttgc 
ccatggaaag 
tcccagagaa 
aacaacgcat 
aagccacttc 
ccagagaggg 
atatcattgc 
tggcccaaaa 
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cagtgctcca gaatggccct acatgctggt 
agtcacaccc ttgtatgcct ttttattcag 
taaagaggaa caaaggtcac agatcaatgg 
tgtatctctt ttcacccaat ttctacaggg 
aacaaaaagg ctacgtaaat ttggtttcag 
tgatgacctc agaaatagcc ctggagcatt 
agttcaaggg gctgccggct ctcagatcgg 
tgtggccatg atcattgcct tctcctttag 
cttccccttc ttggctttat caggagccac 
tcgagataag caggccctgg agatggtggg 
ccgcactgtt gctggaattg gaaaggagag 
ggagaagccc ttcaagacag ccattcagaa 
tgcccagtgc atcatgttta ttgcgaattc 
ctccaatgag gggctccatt tcagctatgt 
tgcaacagct cttggaagag ccttctctta 
agctgcacgc ttttttcaac tgctggaccg 
aggtgaaaaa tgggacaact tccaggggaa 
tccttctcga cctgactcgc aagttctgaa 
gacactggcg tttgttggga gcagtggatg 
acgtttctat gatcctgatc aagggaaggt 
aaatgtccag ttcctccgct caaacattgg 
ctgtagcata atggacaata tcaagtatgg 
agtcatagca gctgcaaaac aggctcagct 
atatgaaact aacgttgggt cccaggggtc 
tgctattgct cgggccattg tacgagatcc 
tgccttagac acagaaagtg aaaagacggt 
tcggacctgc attgtcattg cccatcgctt 
tgtcatggca cagggggtgg tgattgaaaa 
aggagcctac tacaaactag tcaccactgg 



agggtctgtg 
ccagattctt 
tgtgtgccta 
atatgccttt 
ggcaatgctg 
gacaacaaga 
gatgatagtc 
ctggaagctg 
acagaccagg 
acagattaca 
gcggttcatt 
agccaatatt 
tgcttcctac 
gttcagggtg 
caccccaagt 
acaaccccca 
gattgatttt 
tggtctctca 
tggcaaaagc 
gatgatagat 
aattgtttcc 
agacaacacc 
gcatgatttt 
tcaactctct 
taaaatcttg 
gcaggttgct 
gtccaccatc 
ggggacccat 
atcccccatc 



2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

3966 



<210> 3 

<211> 4650 

<212> DNA 

<213> Homo sapiens 



<400> 3 
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aatccaqqaa 


tcatgctgga 


gaagttctgc 


aactctactt 


tttggaattc 


ctcattcctg 


60 


qacaotccqq 

23 w w 23 2? 


aqqcaqacct 


gccactttgt: 


tttgagcaaa 


ctqttctqqt 


gtggattccc 


120 


ttgggcttcc 


tatggctcct 


qqccccctqq 


cagcmictcc 


acgtgtataa 


atccaggacc 


180 


aagagatcct 


ctaccaccaa 


actctatctt 


gctaagcagg 


tattcgttgg 


ttttcttctt 


240 


attctagcag 


ccatagagct 




ctcacagaag 


actctggaca 


agccacagtc 


300 


cctgctg"tt:c 


gata*taccaa 


tccaaacctc 


"tacctaaaca 


cataactcct 

1*4. 1— XH V- V- V_ V* 


q 01111111! a eta 


360 


atccaataca 


gcagacaatg 


g*tg"tg*tacag 


aaaaactcct 


ggttcctgtc 


cctattctgg 


420 


attctctcga 


tactctataa 

ZJ 23 23 


cacttticcaa 


tttxagactc 


"tgatccggac 


actcttacag 


480 


ggtgacaatt 


ctaa*tctagc 


ctactcctgc 


ctgtitcttca 


tctcctacgg 


attccagatc 


540 


ctgatcctga 


tcttttcagc 


attttcagaa 


aataatigagt: 


caticaaatiaa 


tccatcatcc 


600 


atagcttcat 


tcctgagtag 


cattacctac 


aactoatato 


acagcatcat 


"tctgaaaggc 


660 


fcacaagcgtc, 


ctctgacact 




"taaaaaa"t*ta 
^yyy uw y *~ 23 


atgaagagat 


gaaaaccaag 


720 


Btat:t:agt:ga 


acaaa*t*ttiaa 


aacacacatia 


aaaaaaaaac 


"tacaaaaaac 


caaacaoaca 


780 


ctccagagac 


ggcaggagaa 


gagctxccag 


cagaactctg 


gagccaggct 


gcctggcttg 


840 


aacaagaatc 


agagtcaaag 


ccaagatgcc 


cttatcctaa 


aagatgntga 


aaagaaaaaa 


900 


aaqaaqtctq 


ggaccaaaaa 


agatgttcca 


aaatcctggt 


tqatqaaqqc 

*~ ZJ 23 w, 23 23 


tctgttcaaa 


960 


actttctaca 


tggtgctcct 


gaaatcattc 


ctactgaagc 


tagtgaatga 


catcttcacg 


1020 


tttqtqaqtc 


ctxagctgct 


gaaattgctg 


atctcctttg 


caagtgaccg 


tgacacaliat 


1080 


"ttig"tggat:"tg 


gatatctctg 


"tacaatcctc 

w s^f v« w*. w, ^ w Vp#- 


"t'tat'tcacta 


ca octet cat: 


tcagtctttc 


1140 


tgccttcagu 


gttanttcca 


actgtgcttc 




"taaaagtacg 


gacagcta"tc 


1200 


atqqcttctq 


tatataagaa 


aacattaacc 


ctatccaact 


taaccaaaaa 


ggagtacacc 


1260 


gttggagaaa 


cagtgaacct 


qatqtctqtq 


gatgcccaga 


aqctcatqqa 


tgtgaccaac 


1320 


ttcatgcaca 


tqctqtqqtc 


aagtgttcta 


cagattgtct 


tatctatctt 


cttcctatgg 


1380 


iHiaqaqttqq 


gaccctcagt 


cttagcaggt 


attqqaataa 

zi L yyyy u y u 


*taotaci:t:qt: 


aatcccaatt 


1440 


ciatgcgatac 


tgtccaccaa 


gagtaagacc 


attcaggtca 


aaaatatgaa 


gaataaagac 


1500 


aaacgtftaa 


aga"tca*tgaa 


taaaattctt 

W \A VA 4— v_ Vp. 


aataaaatca 


aaatcctaaa 


atattttacc 


1560 


taaqaacctt 


catitxagaga 


ccaagtacaa 


aacctccgga 


agaaagagct 


caagaacctg 


1620 


ctggccttta 


gtcaactaca 


atat:al:aa"ta 

23 v-y «- 23 *-*-*y ^-i* 


atattcatct: 


"tccaatiiiaac 


tccaatccta 


1680 


qtatctqtqq 

ZJ w *~ ^— ^ 22 23 23 


tcacattttc 


tgtttatgtc 




gcaacaatat 


ti"ttiggat:gca 


1740 


caaaaggcct 


tcacctccat 


taccctcttc 


aatatcctgc 


gctttcccct 


gagcatgett 


1800 


cccatgatga 


tctcctccat 


gctccaggcc 


agtgtttcca 


cagagegget 


agagaagtac 


1860 


ttgggagggg 


atgacttgga 


cacatctgcc 


attcgacatg 


actgeaattt 


tgacaaagee 


1920 


atgcagtttt 


ctgaggcctc 


ctttacctgg 


gaacatgatt 


cggaagccac 


agtccgagat 


1980 


gtgaacctgg 


acattatggc 


aggccaactt 


gtggctgtga 


taggecctgt 


cggctctggg 


2040 
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aaatcctcct 


tgatatcagc 


catgctggga 


gaaatggaaa 


atgtccacgg 


gcacatcacc 


2100 


atcaagggca 


ccactgccta 


tgtcccacag 


cagtcctgga 


ttcagaatgg 


caccataaaq 


2160 


gacaacatcc 


tttttggaac 


agagtttaat 


qaaaaqaqqt 


accagcaagt 


actggaggcc 


2220 


tgtgctctcc 


tcccagactt 


ggaaatgctg 


cctqqaqqaq 


atttggctga 


gattggagag 


2280 


aagggtataa 


atcttaataa 


aaatcaaaaa 

zj zf zJ ^ z3 


cagcggatica 


qcctqqccaq 

ZJ ZJ ZJ *3 


agctacctac 


2340 


caaaatttag 


acatctatct 


tctagatgac 


cccctgtctg 


caqtqqatqc 

23 22 22 2D 


tcatgtagga 


2400 


aaacatattt 


ttaataaggt 


cttqqqcccc 


aatqqcctqt 

^3 2D J3 


tgaaaggcaa 


gactcgactc 


2460 


ttggttacac 


atagcatgca 


ctttcttcct 


caaqtqqatq 


agattgtagt 


tctggggaat 


2520 


ggaacaattg 


taqaqaaaqq 


atcctacagt 


gctctcctgg 


ccaaaaaagg 


agagtttgct 


2580 


aagaatctga 


agacatttct 


aagacataca 


ggccctgaag 


aggaagccac 


agtccatgat 


2640 


qqcaatqaaa 


aagaagacga 


taactataaa 


ctgatatcca 


gtgtggaaga 


gatccccgaa 


2700 


KLpttgcaqcct 


ccataaccat 


aaqaaqaaaq 

23 *** *** ***;3 3 9 


aacagctttc 


gtcgaacact 


taqccqcaqt 


2760 


■Btaqqtcca 


atggcaggca 


tctgaagtcc 


ctgagaaact 


ccttgaaaac 


tcggaatgtg 


2820 


aatagcctga 


aqqaaqacqa 


agaactagtg 


aaaggacaaa 


aactaattaa 


gaaggaattc 


2880 


atagaaactg 


gaaaggtgaa 


gttctccatc 


tacctggagt 


acctacaagc 


aataggattg 


2940 


ttttcgatat 


tcttcatcat 


ccttgcgttt 


qtqatqaatt 

Zl ZJ 53 


ctgtggcttt 


tattggatcc 


3000 


aacctctggc 


tcagtgcttg 


gaccagtgac 


tctaaaatct 


tcaatagcac 


cgactatcca 


3060 


gcatctcaga 


gggacatgag 


aqttqqaqtc 


tacggagctc 


tgggattagc 


ccaaggtata 


3120 


tttgtgttca 


tagcacattt 


ctqqaqtqcc 


tttggtttcg 


tccatgcatc 


aaatatcttg 


3180 


cacaagcaac 


tgctgaacaa 


tatccttcga 


gcacctatga 


gattttttga 


cacaacaccc 


3240 


acaggccgga 


ttgtgaacag 


gtttgccggc 


gatatttcca 


cagtggatga 


caccctgcct 


3300 


cagtccttgc 


gcagctggat 


tacatgcttc 


ctggggataa tcagcaccct 


tgtcatgatc 


3360 


tqcatqqcca 


ctcctgtctt 


caccatcatc 


gtcattcctc ttggcattat 


ttatgtatct 


3420 


^•tcaqatqt 


tttatgtgtc 


tacctcccgc 


cagctgaggc 


gtctggactc 


tqtcaccaqq 

23 2D 2D 


3480 


Tccccaatct 


actctcactt 


cagcgagacc 


gtatcaggtt 


tgccagttat 


ccgtgccttt 


3540 


qaqcaccaqc 


agcgatttct 


gaaacacaat 


gaggtgagga ttgacaccaa 


ccaqaaatqt 


3600 


gtcttttcct 


ggatcacctc 


caacaqqtqq 


cttgcaattc 


gcctggagct 


ggttgggaac 


3660 


ctgactgtct 


tcttttcagc 


cttgatgatg 


gttatttata 


gagataccct 


aagtggggac 


3720 


actgttggct 


ttgttctgtc 


caatgcactc 


aatatcacac 


aaaccctgaa 


ctggctggtg 


3780 


aggatgacat 


cagaaataga 


gaccaacatt 


gtggctgttg 


agcgaataac 


tgagtacaca 


304U 


aaagtggaaa 


atgaggcacc 


ctgggtgact 


gataagaggc 


ctccgccaga 


ttggcccagc 


3900 


aaaggcaaga 


tccagtttaa 


caactaccaa 


gtgcggtacc 


gacctgagct 


ggatctggtc 


3960 


ctcagaggga 


tcacttgtga 


cattggtagc 


atggagaaga ttggtgtggt 


gggcaggaca 


4020 


ggagctggaa 


agtcatccct 


cacaaactgc 


ctcttcagaa 


tcttagaggc 


tgccggtggt 


4080 
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cagattatca 


ttgatggagt 


agatattgct 


tccattgggc 


tccacgacct 


ccgagagaag 


4140 


ctgaccatca 


tcccccagga 


ccccatcctg 


ttctctggaa 


gcctgaggat 


gaatctcgac 


4200 


cctttcaaca 


actactcaga 


taaaaaaatt 


taaaaaacct 


taaaactaac 


tcacctcaag 


4260 


tcttttatcjQ 


ccagcctgca 


acttgggtta 


tcccacgaag 


taacaaaaac 


tggtggcaac 


4320 


ctgagcatag 


accaaaaaca 


actactatac 


ctgggcaggg 


ctctgcttcg 


gaaatccaag 


4380 


atcctggtcc 


tggatgaggc 


cactgctgcg 


gtggatctag 


agacagacaa 


cctcattcag 


A A AC\ 

444U 


acgaccatcc 


aaaacgagtt 


cgcccactgc 


acagtgatca 


ccatcgccca 


caggctgcac 


4500 


accatcatgg 


acagtgacaa 


ggtaatggtc 


ctagacaacg 


ggaagattat 


agagtgcggc 


4560 


agccctgaag 


aactgctaca 


aatccctgga 


cccttttact ttatggctaa 


ggaagctggc 


4620 


attgagaatg 


tgaacagcac 


aaaattctag 








4650 




at 



<210> 4 
211> 6433 
12> DNA 

13> Artificial Sequence 
<220> 

<223> Plasmid 
<400> 4 

gacggatcgg gagatctccc gatcccctat 
ccgcatagtt aagccagtat ctgctccctg 
cgagcaaaat ttaagctaca acaaggcaag 
ttagggttag gcgttttgcg ctgcttcgcg 
gattattgac tagttattaa tagtaatcaa 
tggagttccg cgttacataa cttacggtaa 
cccgcccatt gacgtcaata atgacgtatg 
attgacgtca atgggtggag tatttacggt 
catatgcc aagtacgccc cctattgacg 
tgcccagta catgacctta tgggactttc 
tcgctattac catggtgatg cggttttggc 
actcacgggg atttccaagt ctccacccca 
aaaatcaacg ggactttcca aaatgtcgta 
gtaggcgtgt acggtgggag gtctatataa 
ctgcttactg gcttatcgaa attaatacga 
gtttaaactt aagcttggta ccgagctcgg 
ctgccccatt caacttcacc ctgccaccca 
tgagcgtcat cctggtgttc atgttgttct 
agttcagcaa gatcaaggct cacttatgga 



ggtgcactct cagtacaatc tgctctgatg 60 

cttgtgtgtt ggaggtcgct gagtagtgcg 120 

gcttgaccga caattgcatg aagaatctgc 180 

atgtacgggc cagatatacg cgttgacatt 240 

ttacggggtc attagttcat agcccatata 300 

atggcccgcc tggctgaccg cccaacgacc 360 

ttcccatagt aacgccaata gggactttcc 420 

aaactgccca cttggcagta catcaagtgt 480 

tcaatgacgg taaatggccc gcctggcatt 540 

ctacttggca gtacatctac gtattagtca 600 

agtacatcaa tgggcgtgga tagcggtttg 660 

ttgacgtcaa tgggagtttg ttttggcacc 720 

acaactccgc cccattgacg caaatgggcg 780 

gcagagctct ctggctaact agagaaccca 840 

ctcactatag ggagacccaa gctggctagc 900 

atcccaggag gatggaggcc cacaacgcgt 960 

actttggcaa gcgccccaca gacctggcac 1020 

tcatcatgct ctcgctgggc tgcaccatgg 1080 

agcctaaagg gctggccatc gccctggtgg 1140 
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cacagtatgg catcatgccc ctcacggcct ttgtgctggg caaggtcttc cggctgaaga 1200 

acattgaggc actggccatc ttggtctgtg gctgctcacc tggagggaac ctgtccaatg 1260 

tcttcagtct ggccatgaag ggggacatga acctcagcat tgtgatgacc acctgctcca 1320 

ccttctgtgc ccttggcatg atgcctctcc tcctgtacat ctactccagg gggatctatg 1380 

atggggacct gaaggacaag gtgccctata aaggcatcgt gatatcactg gtcctggttc 1440 

tcattccttg caccataggg atcgtcctca aatctaaacg gccacaatac atgcgctatg 1500 

tcatcaaggg agggatgatc atcattctct tgtgcagtgt ggccgtcaca gttctctctg 1560 

ccatcaatgt ggggaagagc atcatgtttg ccatgacacc actcttgatt gccacctcct 1620 

ccctgatgcc ttctattggc tttctgctgg gttatgttct ctctgctctc ttctgcctca 1680 

atggacggtg cagacgcact gtcagcatgg agactggatg ccaaaatgtc caactctgtt 1740 

ccaccatcct caatgtggcc tttccacctg aagtcattgg accacttttc ttctttcccc 1800 

cctctacat gattttccag cttggagaag ggcttctcct cattgccata ttttggtgct 1860 

gagaaatt caagactccc aaggataaaa caaaaatgat ctacacagct gccacaactg 1920 

aagaaacaat tccaggagct ctgggaaatg gcacctacaa aggggaggac tgctcccctt 1980 

gcacagccta gcccttctag agggcccgtt taaacccgct gatcagcctc gactgtgcct 2040 

tctagttgcc agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt 2100 

gccactccca ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg 2160 

tgtcattcta ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac 2220 

aatagcaggc atgctgggga tgcggtgggc tctatggctt ctgaggcgga aagaaccagc 2280 

tggggctcta gggggtatcc ccacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg 2340 

gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct 2400 

ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg 2460 

ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag 2520 

itgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg 2580 

agtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc 2640 

tcggtctatt cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat 2700 

gagctgattt aacaaaaatt taacgcgaat taattctgtg gaatgtgtgt cagttagggt 2760 

gtggaaagtc cccaggctcc ccagcaggca gaagtatgca aagcatgcat ctcaattagt 2820 

cagcaaccag gtgtggaaag tccccaggct ccccagcagg cagaagtatg caaagcatgc 2880 

atctcaatta gtcagcaacc atagtcccgc ccctaactcc gcccatcccg cccctaactc 2940 

cgcccagttc cgcccattct ccgccccatg gctgactaat tttttttatt tatgcagagg 3000 

ccgaggccgc ctctgcctct gagctattcc agaagtagtg aggaggcttt tttggaggcc 3060 

taggcttttg caaaaagctc ccgggagctt gtatatccat tttcggatct gatcaagaga 3120 

caggatgagg atcgtttcgc atgattgaac aagatggatt gcacgcaggt tctccggccg 3180 
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cttgggtgga 
ccgccgtgtt 
ccggtgccct 
gcgttccttg 
tgggcgaagt 
ccatcatggc 
accaccaagc 
atcaggatga 
tcaaggcgcg 
cgaatatcat 
tggcggaccg 
cgaatgggc 
gccttcta 
cgaecaagcg 
aaggttgggc 
tctcatgctg 
ataaagcaat 
tggtttgtcc 
gagcttggcg 
tccacacaac 
ctaactcaca 
ccagctgcat 
ttccgcttcc 
jCtcactca 
atgtgagca 
tttccatagg 
gcgaaacccg 
ctctcctgtt 
cgtggcgctt 
caagctgggc 
ctatcgtctt 
taacaggatt 
taactacggc 
cttcggaaaa 
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gaggctattc ggctatgact gggcacaaca gacaatcggc tgctctgatg 3240 

ccggctgtca gcgcaggggc gcccggttct ttttgtcaag accgacctgt 3300 

gaatgaactg caggacgagg cagcgcggct atcgtggctg gccacgacgg 3360 

cgcagctgtg ctcgacgttg tcactgaagc gggaagggac tggctgctat 3420 

gccggggcag gatctcctgt catctcacct tgctcctgcc gagaaagtat 3480 

tgatgcaatg cggcggctgc atacgcttga tccggctacc tgcccattcg 3540 

gaaacatcgc atcgagcgag cacgtactcg gatggaagcc ggtcttgtcg 3600 

tctggacgaa gagcatcagg ggctcgcgcc agccgaactg ttcgccaggc 3660 

catgcccgac ggcgaggatc tcgtcgtgac ccatggcgat gcctgcttgc 3720 

ggtggaaaat ggccgctttt ctggattcat cgactgtggc cggctgggtg 3780 

ctatcaggac atagcgttgg ctacccgtga tattgctgaa gagcttggcg 3840 

tgaccgcttc ctcgtgcttt acggtatcgc cgctcccgat tcgcagcgca 3900 

tcgccttctt gacgagttct tctgagcggg actctggggt tcgaaatgac 3960 

acgcccaacc tgccatcacg agatttcgat tccaccgccg ccttctatga 4020 

ttcggaatcg ttttccggga cgccggctgg atgatcctcc agcgcgggga 4080 

gagttcttcg cccaccccaa cttgtttatt gcagcttata atggttacaa 4140 

agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 4200 

aaactcatca atgtatctta tcatgtctgt ataccgtcga cctctagcta 4260 

taatcatggt catagctgtt tcctgtgtga aattgttatc cgctcacaat 4320 

atacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag 4380 

ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg 4440 

taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc 4500 

tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc 4560 

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa 4620 

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt 4680 

ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg 4740 

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg 4800 

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag 4860 

tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc 4920 

tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa 4980 

gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 5040 

agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc 5100 

tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac 5160 

agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggttt 5220 
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ttttgtttgc 


aagcagcaga 


ttacgcgcag 


aaaaaaagga 


tctcaagaag 


atcctttgat 


5280 


cttttctacg 


gggtctgacg 


ctcagtggaa 


cgaaaactca 


cgttaaggga 


ttttggtcat 


5340 


gagattatca 


aaaaggatct 


tcacctagat 


ccttttaaat 


taaaaatgaa 


gttttaaatc 


5400 


aatctaaagt 


atatatgagt 


aaacttggtc 


tgacagttac 


caatgcttaa 


tcagtgaggc 


5460 


acctatctca 


gcgatctgtc 


tatttcgttc 


atccatagtt 


gcctgactcc 


ccgtcgtgta 


5520 


gataactacg 


atacgggagg 


gcttaccatc 


tggccccagt 


gctgcaatga 


taccgcgaga 


5580 


cccacgctca 


ccggctccag 


atttatcagc 


aataaaccag 


ccagccggaa 


gggccgagcg 


5640 


cagaagtggt 


cctgcaactt 


tatccgcctc 


catccagtct 


attaattgtt 


gccgggaagc 


5700 


tagagtaagt 


agttcgccag 


ttaatagttt 


gcgcaacgtt 


gttgccattg 


ctacaggcat 


5760 


cgtggtgtca 


cgctcgtcgt 


ttggtatggc 


ttcattcagc 


tccggttccc 


aacgatcaag 


5820 


gcgagttaca 


tgatccccca 


tgttgtgcaa 


aaaagcggtt 


agctccttcg 


gtcctccgat 


5880 


£qttgtcaga 


agtaagttgg 


ccgcagtgtt 


atcactcatg 


gttatggcag 


cactgcataa 


5940 


Mctcttact 


gtcatgccat 


ccgtaagatg 


cttttctgtg 


actggtgagt 


actcaaccaa 


6000 


gtcattctga 


gaatagtgta 


tgcggcgacc 


gagttgctct 


tgcccggcgt 


caatacggga 


6060 


taataccgcg 


ccacatagca 


gaactttaaa 


agtgctcatc 


attggaaaac 


gttcttcggg 


6120 


gcgaaaactc 


tcaaggatct 


taccgctgtt 


gagatccagt 


tcgatgtaac 


ccactcgtgc 


6180 


■3 /~ *~ ~3 rjr*"+*nta 
dCCCclclL. Lyd. 




CIuItaCLlI 


^ r~ f~ ^ f"1 "t" *t~ 

caccay cgix 


xcxggg xgag 


CddaddCagg 




aaggcaaaat 


gccgcaaaaa 


agggaataag 


ggcgacacgg 


aaatgttgaa 


tactcatact 


6300 


cttccttttt 


caatattatt 


gaagcattta 


tcagggttat 


tgtctcatga 


gcggatacat 


6360 


atttgaatgt 


atttagaaaa 


ataaacaaat 


aggggttccg 


cgcacatttc 


cccgaaaagt 


6420 


gccacctgac 


gtc 










6433 




5 

9043 
DNA 

Artificial Sequence 



PTasmid 
<400> 5 



gacggatcgg 


gagatctccc 


gatcccctat 


ggtcgactct 


cagtacaatc 


tgctctgatg 


60 


ccgcatagtt 


aagccagtat 


ctgctccctg 


cttgtgtgtt 


ggaggtcgct 


gagtagtgcg 


120 


cgagcaaaat 


ttaagctaca 


acaaggcaag 


gcttgaccga 


caattgcatg 


aagaatctgc 


180 


ttagggttag 


gcgttttgcg 


ctgcttcgcg 


atgtacgggc 


cagatatacg 


cgttgacatt 


240 


gattattgac 


tagttattaa 


tagtaatcaa 


ttacggggtc 


attagttcat 


agcccatata 


300 


tggagttccg 


cgttacataa 


cttacggtaa 


atggcccgcc 


tggctgaccg 


cccaacgacc 


360 


cccgcccatt 


gacgtcaata 


atgacgtatg 


ttcccatagt 


aacgccaata 


gggactttcc 


420 


attgacgtca atgggtggac 


tatttacggt 


aaactgccca 


cttggcagta 


catcaagtgt 


480 
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atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 

atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 

tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 

actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 

aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 

gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 

ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 

gtttaaactt aagctatcac aagtttgtac aaaaaagcag gcttaggaat gtctgactca 960 

gtaattcttc gaagtataaa gaaatttgga gaggagaatg atggttttga gtcagataaa 1020 

tcatataata atgataagaa atcaaggtta caagatgaga agaaaggtga tggcgttaga 1080 

gttggcttct ttcaattgtt tcggttttct tcatcaactg acatttggct gatgtttgtg 1140 

gaagtttgt gtgcatttct ccatggaata gcccagccag gcgtgctact catttttggc 1200 

aatgacag atgtttttat tgactacgac gttgagttac aagaactcca gattccagga 1260 

aaagcatgtg tgaataacac cattgtatgg actaacagtt ccctcaacca gaacatgaca 1320 

aatggaacac gttgtgggtt gctgaacatc gagagcgaaa tgatcaaatt tgccagttac 1380 

tatgctggaa ttgctgtcgc agtacttatc acaggatata ttcaaatatg cttttgggtc 1440 

attgccgcag ctcgtcagat acagaaaatg agaaaatttt actttaggag aataatgaga 1500 

atggaaatag ggtggtttga ctgcaattca gtgggggagc tgaatacaag attctctgat 1560 

gatattaata aaatcaatga tgccatagct gaccaaatgg cccttttcat tcagcgcatg 1620 

acctcgacca tctgtggttt cctgttggga tttttcaggg gttggaaact gaccttggtt 1680 

attatttctg tcagccctct cattgggatt ggagcagcca ccattggtct gagtgtgtcc 1740 

aagtttacgg actatgagct gaaggcctat gccaaagcag gggtggtggc tgatgaagtc 1800 

atttcatcaa tgagaacagt ggctgctttt ggtggtgaga aaagagaggt tgaaaggtat 1860 

gaaaaatc ttgtgttcgc ccagcgttgg ggaattagaa aaggaatagt gatgggattc 1920 

ttactggat tcgtgtggtg tctcatcttt ttgtgttatg cactggcctt ctggtacggc 1980 

tccacacttg tcctggatga aggagaatat acaccaggaa cccttgtcca gattttcctc 2040 

agtgtcatag taggagcttt aaatcttggc aatgcctctc cttgtttgga agcctttgca 2100 

actggacgtg cagcagccac cagcattttt gagacaatag acaggaaacc catcattgac 2160 

tgcatgtcag aagatggtta caagttggat cgaatcaagg gtgaaattga attccataat. 2220 

gtgaccttcc attatccttc cagaccagag gtgaagattc taaatgacct caacatggcc 2280 

attaaaccag gggaaatgac agctctggta ggacccagtg gagctggaaa aagtacagca 2340 

ctgcaactca ttcagcgatt ctatgacccc tgtgaaggaa tggtgaccgt ggatggccat 2400 

gacattcgct ctcttaacat tcagtggctt agagatcaga ttgggatagt ggagcaagag 2460 

ccagttctgt tctctaccac cattgcagaa aatattcgct atggcagaga agatgcaaca 2520 
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atggaagaca 


tagtccaagc 


tgccaaggag 


gccaatgcct 


acaacttcat 


catggacctg 


2580 


ccacagcaat 


ttgacaccct 


tgttggagaa 


ggaggaggcc 


agatgagtgg 


tggccagaaa 


2640 


caaagggtag 


ctatcgccag 


agccctcatc 


cgaaatccca 


agattctgct 


tttggacatg 


2700 


gccacctcag 


ctctggacaa 


tgagagtgaa 


gccatggtgc 


aagaagtgct 


gagtaagatt 


2760 


cagcatgggc 


acacaatcat 


ttcagttgct 


catcgcttgt 


ctacggtcag 


agctgcagat 


2820 


accatcattg 


gttttgaaca 


tggcactgca 


gtggaaagag 


ggacccatga 


agaattactg 


2880 


gaaaggaaag 


gtgtttactt 


cactctagtg 


actttgcaaa 


gccagggaaa 


tcaagctctt 


2940 


aatgaagagg 


acataaagga 


tgcaactgaa 


gatgacatgc 


ttgcgaggac 


ctttagcaga 


3000 


gggagctacc 


aggatagttt 


aagggcttcc 


atccggcaac 


gctccaagtc 


tcagctttct 


3060 


tacctggtgc 


acgaacctcc 


attagctgtt 


gtagatcata 


agtctaccta 


tgaagaagat 


3120 


agaaaggaca 


aggacattcc 


tgtgcaggaa 


gaagttgaac 


ctgccccagt 


taggaggatt 


3180 


^tgaaattca 


gtgctccaga 


atggccctac 


atgctggtag 


ggtctgtggg 


tgcagctgtg 


3240 


Bcgggacag 


tcacaccctt 


gtatgccttt 


ttattcagcc 


agattcttgg 


gactttttca 


3300 


attcctgata 


aagaggaaca 


aaggtcacag 


atcaatggtg 


tgtgcctact 


ttttgtagca 


3360 


atgggctgtg 


tatctctttt 


cacccaattt 


ctacagggat 


atgcctttgc 


taaatctggg 


3420 


gagctcctaa 


caaaaaggct 


acgtaaattt 


ggtttcaggg 


caatgctggg 


gcaagatatt 


3480 


gcctggtttg 


atgacctcag 


aaatagccct 


ggagcattga 


caacaagact 


tgctacagat 


3540 


gcttcccaag 


ttcaaggggc 


tgccggctct 


cagatcggga 


tgatagtcaa 


ttccttcact 


3600 


aacgtcactg 


tggccatgat 


cattgccttc 


tcctttagct 


ggaagctgag 


cctggtcatc 


3660 


ttgtgcttct 


tccccttctt 


ggctttatca 


ggagccacac 


agaccaggat 


gttgacagga 


3720 


tttgcctctc 


gagataagca 


ggccctggag 


atggtgggac 


agattacaaa 


tgaagccctc 


3780 


agtaacatcc 


gcactgttgc 


tggaattgga 


aaggagaggc 


ggttcattga 


agcacttgag 


3840 


actgagctgg 


agaagccctt 


caagacagcc 


attcagaaag 


ccaatattta 


cggattctgc 


3900 


«tgcctttg 


cccagtgcat 


catgtttatt 


gcgaattctg 


cttcctacag 


atatggaggt 


3960 


^cttaatct 


ccaatgaggg 


gctccatttc 


agctatgtgt 


tcagggtgat 


ctctgcagtt 


4020 


gtactgagtg 


caacagctct 


tggaagagcc 


ttctcttaca 


ccccaagtta 


tgcaaaagct 


4080 


aaaatatcag 


ctgcacgctt 


ttttcaactg 


ctggaccgac 


aacccccaat 


cagtgtatac 


4140 


aatactgcag 


gtgaaaaatg 


ggacaacttc 


caggggaaga 


ttgattttgt 


tgattgtaaa 


4200 


tttacatatc 


cttctcgacc 


tgactcgcaa 


gttctgaatg 


gtctctcagt 


gtcgattagt 


4260 


ccaqqqcaqa 


cactggcgtt: 


"tattaaaaac 

•3 23 ill S3 23 


aataaatata 

23 23 23 23 23 


gcaaaagcac 


tagcattcag 


4320 


ctgttggaac 


gtttctatga 


tcctgatcaa 


gggaaggtga 


tgatagatgg 


tcatgacagc 


4380 


aaaaaagtaa 


atgtccagtt 


cctccgctca 


aacattggaa 


ttgtttccca 


ggaaccagtg 


4440 


ttgtttgcct 


gtagcataat 


ggacaatatc 


aagtatggag 


acaacaccaa 


agaaattccc 


4500 


atggaaagag 


tcatagcagc 


tgcaaaacag 


gctcagctgc 


atgattttgt 


catgtcactc 


4560 
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ccagagaaat 


atgaaactaa 


cgttgggtcc 


caggggtctc 


aactctctag 


aggggagaaa 


4620 


caacgcattg 


ctattgctcg 


ggccattgta 


cgagatccta 


aaatcttgct 


actagatgaa 


4680 


gccacttctg 


ccttagacac 


agaaagtgaa 


aagacggtgc aggttgctct 


agacaaagcc 


4740 


agagagggtc 


ggacctgcat 


tgtcattgcc 


catcgcttgt 


ccaccatcca 


gaacgcggat 


4800 


atcattgctg 


tcatggcaca 


gggggtggtg 


attgaaaagg 


ggacccatga 


agaactgatg 


4860 


gcccaaaaag 


gagcctacta 


caaactagtc 


accactggat 


cccccatcag 


ttgagaccca 


4920 


gctttcttgt 


acaaagtggt 


gattggtacc 


gagctcggat 


ccactagtcc 


agtgtggtgg 


4980 


aattctgcag 


atatccagca 


cagtggcggc 


cgctcgagtc 


tagagggccc 


gtttaaaccc 


5040 


gctgatcagc 


ctcgactgtg 


ccttctagtt 


gccagccatc 


tgttgtttgc 


ccctcccccg 


5100 


tgccttcctt 


gaccctggaa 


ggtgccactc 


ccactgtcct 


ttcctaataa 


aatgaggaaa 


5160 


ttgcatcgca 


ttgtctgagt 


aggtgtcatt 


ctattctggg 


gggtggggtg 


gggcaggaca 


5220 


^caaggggga 


ggattgggaa 


gacaatagca 


ggcatgctgg 


ggatgcggtg 


ggctctatgg 


5280 


Wtctgaggc 


ggaaagaacc 


agctggggct 


ctagggggta 


tccccacgcg 


ccctgtagcg 


5340 


gcgcattaag 


cgcggcgggt 


gtggtggtta 


cgcgcagcgt 


gaccgctaca 


cttgccagcg 


5400 


ccctagcgcc 


cgctcctttc 


gctttcttcc 


cttcctttct 


cgccacgttc 


gccggctttc 


5460 


cccgtcaagc 


tctaaatcgg 


ggcatccctt 


tagggttccg 


atttagtgct 


ttacggcacc 


5520 


tcgaccccaa 


aaaacttgat 


tagggtgatg 


gttcacgtag 


tgggccatcg 


ccctgataga 


5580 


cggtttttcg 


ccctttgacg 


ttggagtcca 


cgttctttaa 


tagtggactc 


ttgttccaaa 


5640 


ctggaacaac 


actcaaccct 


atctcggtct 


attcttttga 


tttataaggg 


attttgggga 


5700 


tttcggccta 


ttggttaaaa 


aatgagctga 


tttaacaaaa 


atttaacgcg 


aattaattct 


5760 


gtggaatgtg 


tgtcagttag 


ggtgtggaaa 


gtccccaggc 


tccccaggca 


ggcagaagta 


5820 


tgcaaagcat 


gcatctcaat 


tagtcagcaa 


ccaggtgtgg 


aaagtcccca 


ggctccccag 


5880 


caggcagaag 


tatgcaaagc 


atgcatctca 


attagtcagc 


aaccatagtc 


ccgcccctaa 


5940 


*mcegcccat 


cccgccccta 


actccgccca 


gttccgccca ttctccgccc 


catggctgac 


6000 


Waattttttt 


tatttatgca 


gaggccgagg 


ccgcctctgc 


ctctgagcta 


ttccagaagt 


6060 


agtgaggagg 


cttttttgga 


ggcctaggct 


tttgcaaaaa 


gctcccggga 


gcttgtatat 


6120 


ccattttcgg 


atctgatcag 


cacgtgttga 


caattaatca 


tcggcatagt 


atatcggcat 


6180 


agtataatac 


gacaaggtga 


ggaactaaac 


catggccaag 


ttgaccagtg 


ccgttccggt 


6240 


gctcaccgcg 


cgcgacgtcg 


ccggagcggt 


cgagttctgg 


accgaccggc 


tcgggttctc 


6300 


ccgggacttc 


ataaaaaaca 


acttcaccaa 


tgtggtccgg 


gacgacgtga 


ccc"ta"t"tcal: 


6360 


cagcgcggtc 


caggaccagg 


tggtgccgga 


caacaccctg 


gcctgggtgt 


gggtgcgcgg 


6420 


cctggacgag 


ctgtacgccg 


agtggtcgga 


ggtcgtgtcc 


acgaacttcc 


gggacgcctc 


6480 


cgggccggcc 


atgaccgaga 


tcggcgagca 


gccgtggggg 


cgggagttcg 


ccctgcgcga 


6540 


cccggccggc 


aactgcgtgc 


acttcgtggc 


cgaggagcag 


gactgacacg 


tgctacgaga 


6600 
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tttcgattcc accgccgcct tctatgaaag gttgggcttc ggaatcgttt tccgggacgc 6660 

cggctggatg atcctccagc geggggatet catgetggag ttcttcgccc accccaactt 6720 

gtttattgea gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 6780 

agcatttttt teactgeatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 6840 

tgtctgtata ccgtcgacct ctagctagag cttggcgtaa tcatggtcat agctgtttcc 6900 

tgtgtgaaat tgttatccgc tcacaattcc acacaacata egagceggaa gcataaagtg 6960 

taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 7020 

cgctttccag tegggaaace tgtcgtgcca getgeattaa tgaateggee aacgegeggg 7080 

gagaggeggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 7140 

ggtcgttcgg ctgeggegag eggtatcage tcactcaaag geggtaatae ggttatccac 7200 

agaatcaggg gataaegcag gaaagaacat gtgagcaaaa ggccagcaaa aggecaggaa 7260 

gtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 7320 

aaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 7380 

gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaceggata 7440 

cctgtccgcc tttctccctt egggaagegt ggegctttet caatgctcac gctgtaggta 7500 

tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgeacgaac cccccgttca 7560 

gcccgaccgc tgegecttat ceggtaacta tegtcttgag tccaacccgg taagacacga 7620 

cttatcgcca ctggcagcag ccactggtaa caggattagc agagegaggt atgtaggcgg 7680 

tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 7740 

tatctgeget ctgetgaage cagttacctt eggaaaaaga gttggtagct ettgatcegg 7800 

caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 7860 

aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 7920 

cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 7980 

jttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 8040 

gacagttac caatgettaa tcagtgaggc acctatctca gcgatctgtc tatttegtte 8100 

atccatagtt gcctgactcc ccgtcgtgta gataactacg ataegggagg gcttaccatc 8160 

tggccccagt getgeaatga taccgegaga cccacgctca ccggctccag atttatcagc 8220 

aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 8280 

catccagtct attaattgtt geegggaage tagagtaagt agttcgccag ttaatagttt 8340 

gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 8400 

ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 8460 

aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 8520 

atcactcatg gttatggcag cactgeataa ttctcttact gtcatgecat ccgtaagatg 8580 

cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 8640 
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gagttgctct 


tgcccggcgt 


caatacggga 


taataccgcg 


ccacatagca gaactttaaa 


8700 


agtgctcatc 


attggaaaac 


gttcttcggg 


gcgaaaactc 


traaooptrl" 




8760 


gagatccagt 


tcgatgtaac 


ccactcgtgc 


acccaactga 


tcttcagcat 


cttttacttt 




caccagcgtt 


tctgggtgag 


caaaaacagg 


aaggcaaaat 


gccgcaaaaa 


agggaataag 


8880 


ggcgacacgg 


aaatgttgaa 


tactcatact 


cttccttttt 


caatattatt 


gaagcattta 


8940 


tcagggttat 


tgtctcatga 


gcggatacat 


atttgaatgt 


atttagaaaa 


ataaacaaat 


9000 


aggggttccg 


cgcacatttc 


cccgaaaagt 


gccacctgac 


gtc 




9043 




_ t c 
^^^^ 



<210> 6 
<211> 9658 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid 
00> 6 

cggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 

ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 

cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 

ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 

gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 

tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 

cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 

. attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 

atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 540 

atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 

tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 

tcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 

aaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 

gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 

ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 

agtccaggaa tcatgctgga gaagttctgc aactctactt tttggaattc ctcattcctg 960 

gacagtccgg aggcagacct gccactttgt tttgagcaaa ctgttctggt gtggattccc 1020 

ttgggcttcc tatggctcct ggccccctgg cagcttctcc acgtgtataa atccaggacc 1080 

aagagatcct ctaccaccaa actctatctt gctaagcagg tattcgttgg ttttcttctt 1140 

attctagcag ccatagagct ggcccttgta ctcacagaag actctggaca agccacagtc 1200 

cctgctgttc gatataccaa tccaagcctc tacctaggca catggctcct ggttttgctg 1260 

atccaataca gcagacaatg gtgtgtacag aaaaactcct ggttcctgtc cctattctgg 1320 
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attctctcga tactctgtgg cactttccaa tttcagactc tgatccggac actcttacag 1380 

ggtgacaatt ctaatctagc ctactcctgc ctgttcttca tctcctacgg attccagatc 1440 

ctgatcctga tcttttcagc attttcagaa aataatgagt catcaaataa tccatcatcc 1500 

atagcttcat tcctgagtag cattacctac agctggtatg acagcatcat tctgaaaggc 1560 

tacaagcgtc ctctgacact cgaggatgtc tgggaagttg atgaagagat gaaaaccaag 1620 

acattagtga gcaagtttga aacgcacatg aagagagagc tgcagaaagc caggcgggca 1680 

ctccagagac ggcaggagaa gagctcccag cagaactctg gagccaggct gcctggcttg 1740 

aacaagaatc agagtcaaag ccaagatgcc cttgtcctgg aagatgttga aaagaaaaaa 1800 

. aagaagtctg ggaccaaaaa agatgttcca aaatcctggt tgatgaaggc tctgttcaaa 1860 

actttctaca tggtgctcct gaaatcattc ctactgaagc tagtgaatga catcttcacg 1920 

tttgtgagtc ctcagctgct gaaattgctg atctcctttg caagtgaccg tgacacatat 1980 

•tgtggattg. gatatctctg tgcaatcctc ttattcactg cggctctcat tcagtctttc 2040 

lecttcagt gttatttcca actgtgcttc aagctgggtg taaaagtacg gacagctatc 2100 

atggcttctg tatataagaa ggcattgacc ctatccaact tggccaggaa ggagtacacc 2160 

gttggagaaa cagtgaacct gatgtctgtg gatgcccaga agctcatgga tgtgaccaac 2220 

ttcatgcaca tgctgtggtc aagtgttcta cagattgtct tatctatctt cttcctatgg 2280 

agagagttgg gaccctcagt cttagcaggt gttggggtga tggtgcttgt aatcccaatt 2340 

aatgcgatac tgtccaccaa gagtaagacc attcaggtca aaaatatgaa gaataaagac 2400 

aaacgtttaa agatcatgaa tgagattctt agtggaatca agatcctgaa atattttgcc 2460 

tgggaacctt cattcagaga ccaagtacaa aacctccgga agaaagagct caagaacctg 2520 

ctggccttta gtcaactaca gtgtgtagta atattcgtct tccagttaac tccagtcctg 2580 

gtatctgtgg tcacattttc tgtttatgtc ctggtggata gcaacaatat tttggatgca 2640 

caaaaggcct tcacctccat taccctcttc aatatcctgc gctttcccct gagcatgctt 2700 

iitf^Mcatgatga tctcctccat gctccaggcc agtgtttcca cagagcggct agagaagtac 2760 

^^^tggQagggg atgacttgga cacatctgcc attcgacatg actgcaattt tgacaaagcc 2820 

atgcagtttt ctgaggcctc ctttacctgg gaacatgatt cggaagccac agtccgagat 2880 

gtgaacctgg acattatggc aggccaactt gtggctgtga taggccctgt cggctctggg 2940 

aaatcctcct tgatatcagc catgctggga gaaatggaaa atgtccacgg gcacatcacc 3000 

atcaagggca ccactgccta tgtcccacag cagtcctgga ttcagaatgg caccataaag 3060 

gacaacatcc tttttggaac agagtttaat gaaaagaggt accagcaagt actggaggcc 3120 

tgtgctctcc tcccagactt ggaaatgctg cctggaggag atttggctga gattggagag 3180 

aagggtataa atcttagtgg gggtcagaag cagcggatca gcctggccag agctacctac 3240 

caaaatttag acatctatct tctagatgac cccctgtctg cagtggatgc tcatgtagga 3300 

aaacatattt ttaataaggt cttgggcccc aatggcctgt tgaaaggcaa gactcgactc 3360 
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ttggttacac atagcatgca ctttcttcct caagtggatg agattgtagt tctggggaat 3420 

ggaacaattg tagagaaagg atcctacagt gctctcctgg ccaaaaaagg agagtttgct 3480 

aagaatctga agacatttct aagacataca ggccctgaag aggaagccac agtccatgat 3540 

ggcagtgaag aagaagacga tgactatggg ctgatatcca gtgtggaaga gatccccgaa 3600 

gatgcagcct ccataaccat gagaagagag aacagctttc gtcgaacact tagccgcagt 3660 

tctaggtcca atggcaggca tctgaagtcc ctgagaaact ccttgaaaac tcggaatgtg 3720 

aatagcctga aggaagacga agaactagtg aaaggacaaa aactaattaa gaaggaattc 3780 

atagaaactg gaaaggtgaa gttctccatc tacctggagt acctacaagc aataggattg 3840 

ttttcgatat tcttcatcat ccttgcgttt gtgatgaatt ctgtggcttt tattggatcc 3900 

aacctctggc tcagtgcttg gaccagtgac tctaaaatct tcaatagcac cgactatcca 3960 

gcatctcaga gggacatgag agttggagtc tacggagctc tgggattagc ccaaggtata 4020 

ttgtgttca tagcacattt ctggagtgcc tttggtttcg tccatgcatc aaatatcttg 4080 

caagcaac tgctgaacaa tatccttcga gcacctatga gattttttga cacaacaccc 4140 

acaggccgga ttgtgaacag gtttgccggc gatatttcca cagtggatga caccctgcct 4200 

cagtccttgc gcagctggat tacatgcttc ctggggataa tcagcaccct tgtcatgatc 4260 

tgcatggcca ctcctgtctt caccatcatc gtcattcctc ttggcattat ttatgtatct 4320 

gttcagatgt tttatgtgtc tacctcccgc cagctgaggc gtctggactc tgtcaccagg 4380 

tccccaatct actctcactt cagcgagacc gtatcaggtt tgccagttat ccgtgccttt 4440 

gagcaccagc agcgatttct gaaacacaat gaggtgagga ttgacaccaa ccagaaatgt 4500 

gtcttttcct ggatcacctc caacaggtgg cttgcaattc gcctggagct ggttgggaac 4560 

ctgactgtct tcttttcagc cttgatgatg gttatttata gagataccct aagtggggac 4620 

actgttggct ttgttctgtc caatgcactc aatatcacac aaaccctgaa ctggctggtg 4680 

aggatgacat cagaaataga gaccaacatt gtggctgttg agcgaataac tgagtacaca 4740 

agtggaaa atgaggcacc ctgggtgact gataagaggc ctccgccaga ttggcccagc 4800 

r aaggcaaga tccagtttaa caactaccaa gtgcggtacc gacctgagct ggatctggtc 4860 

ctcagaggga tcacttgtga cattggtagc atggagaaga ttggtgtggt gggcaggaca 4920 

ggagctggaa agtcatccct cacaaactgc ctcttcagaa tcttagaggc tgccggtggt 4980 

cagattatca ttgatggagt agatattgct tccattgggc tccacgacct ccgagagaag 5040 

ctgaccatca tcccccagga ccccatcctg ttctctggaa gcctgaggat gaatctcgac 5100 

cctttcaaca actactcaga tgaggagatt tggaaggcct tggagctggc tcacctcaag 5160 

tcttttgtgg ccagcctgca acttgggtta tcccacgaag tgacagaggc tggtggcaac 5220 

ctgagcatag gccagaggca gctgctgtgc ctgggcaggg ctctgcttcg gaaatccaag 5280 

atcctggtcc tggatgaggc cactgctgcg gtggatctag agacagacaa cctcattcag 5340 

acgaccatcc aaaacgagtt cgcccactgc acagtgatca ccatcgccca caggctgcac 5400 
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accatcatgg 


acagtgacaa 


ggtaatggtc 


ctagacaacg 


ggaagattat 


agagtgegge 


5460 


agccctgaag 


aactgctaca 


aatccctqqa 

ZJ ZJ 


cccttttact 


ttatggctaa 


ggaagctggc 


5520 


attgagaatg 


tgaacagcac 


aaaattctag 


cttaagcttg 


gtaccgagct 


cggatccact 


5580 


agtccagtgt 


ggtggaattc 


tgcagatatc 


cagcacagtg 


gcggccgctc 


gagtctagag 


5640 


ggcccgttta 


aacccgctga 


tcagcctcga 


ctgtgccttc 


tagttgccag 


ccatctqttg 


5700 


tttgcccctc 


ccccgtgcct 


tccttgaccc 


tqqaaqqtqc 


cactcccact 


gtcctttcct 


5760 


aataaaatga 


qqaaattqca 


tcgcattgtc 


tqaqtaqqtq 


tcattctatt 


ctqqqqqqtq 


5820 


gggtggggca 

~i ~J ZJ zj zj zj ~j 


ggacagcaag 


gqqqaggatt 

ZJ ZJ ZJ ZJ ZJ ZJ 


gggaagacaa 


tagcaggcat 


qctggggatg 


5880 


cggtgggctc 


tatggcttct 


gaggcggaaa 


gaaccagctg 


gggctctagg 


gggtatcccc 


5940 


acgcgccctg 


tagcggcgca 


ttaagcgcgg 


cgggtgtggt 

ZJ ZJ ZJ ZJ ZJ ZJ 


ggttacgcgc 


agcgtgaccg 


6000 


ctacacttgc 


cagcgcccta 


gcgcccgctc 


ctttcgcttt 


cttcccttcc 


tttctcgcca 


6060 


l^ttcgccgg 


ctttccccgt 


caagctctaa 


atcggggcat 


ccctttaggg 


ttccgattta 


6120 


^■gctttacg 


gcacctcgac 


cccaaaaaac 


ttgattaggg 


tgatggttca 


cgtagtgggc 


6180 


catcgccctg 


atagacggtt 


tttcgccctt 


tgacgttgga 


gtccacgttc 


tttaatagtg 


6240 


gactcttgtt 


ccaaactgga 


acaacactca 


accctatctc 


ggtctattct 


tttgatttat 


6300 


aagggatttt 


ggggatttcg 


gcctattggt 


taaaaaatga 


gctgatttaa 


caaaaattta 


6360 


acgcgaatta 


attctgtgga 


atgtgtgtca 


gttagggtgt 


ggaaagtccc 

mJ ZJ ZJ 


caggctcccc 


6420 


aggcaggcag 


aagtatgcaa 


agcatgcatc 


tcaattagtc 


agcaaccagg 


tgtggaaagt 


6480 


ccccaggctc 


cccagcaggc 


agaagtatgc 


aaagcatgca 


tctcaattag 


tcagcaacca 


6540 


tagtcccgcc 


cctaactccg 


cccatcccgc 


ccctaactcc 


gcccagttcc 


gcccattctc 


6600 


cgccccatgg 


ctgactaatt 


ttttttattt 


atgcagaggc 


cgaggccgcc 

ZJ ZJ ZJ ZJ 


tctgcctctg 

ZJ ZJ 


6660 


agctattcca 


gaagtagtga 


ggaggctttt 


ttggaggcct 


aggcttttgc 


aaaaagctcc 


6720 


cgggagcttg 


tatatccatt 


ttcggatctg 


atcagcacgt 


gttgacaatt 


aatcategge 


6780 


•agtatatc 


ggcatagtat 


aatacgacaa 


ggtgaggaac 


taaaccatgg 


ccaagttgac 


6840 


W 

Tagtgccgtt 


ccggtgctca 


ccgcgcgcga 


cgtcgccgga 


gcggtcgagt 


tctggaccga 


6900 


ccggctcggg 

— ' — ' ~J *J ZJ 


ttctcccggg 


acttcgtgga 


ggacgacttc 


qccgqtqtqg 

ZJ ZJ ZJ *" ZJ ZJ ZJ 


teegggaega 


6960 


cgtgaccctg 


ttcatcagcg 


cggtccagga 


ccaggtggtg 


ccggacaaca 


ccctggcctg 


7020 


ggtgtgggtg 


cgcggcctgg 


acgagctgta 


cgccgagtgg 


teggaggteg 


tgtccacgaa 


7080 


cttecgggac 


gcctccgggc 

ZJ ZJ ZD ZJ 


cggccatgac 


cgagatcggc 


gagcagcegt 


qqqqqcgqqa 

ZJZJZJZJZJ^ZJZJZJ^ 


7140 




c nr r\"Zi f~ <~ (~ r~\t~\ 

cgcy cicc-cyy 


s~ s~ r\ r~\ f~ ~i -a t — H ri 

CCyyCddCig 


cgxgcacxxc 


gxggccgagg 


ageaggaexg 


/ £UU 


acacgtgcta 


cgagatttcg 


attccaccgc 


cgccttctat 


gaaaggttgg 


getteggaat 


7260 


cgttttccgg 


gacgccggct 


ggatgatcct 


ccagcgcggg 


gatctcatgc 


tggagttctt 


7320 


cgcccacccc 


aacttgttta 


ttgcagctta 


taatggttac 


aaataaagca 


atagcatcac 


7380 


aaatttcaca 


aataaagcat 


ttttttcact 


gcattctagt 


tgtggtttgt 


ccaaactcat 


7440 
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caatgtatct 


tatcatgtct 


gtataccgtc 


gacctctagc 


tagagcttgg 


cgtaatcatg 


7500 


qtcataqctq 


tttcctgtgt 


gaaattgtta 


tccgctcaca 


attccacaca 


acatacgagc 


7560 


cggaagcata 


aagtgtaaag 


cctggggtgc 


ctaatgagtg 


agctaactca 


cattaattgc 


7620 


gttgcgctca 


ctgcccgctt 


tccagtcggg 


aaacctgtcg 


tgccagctgc 


attaatgaat 


7680 


cggccaacgc 


qcqqqqaqaq 

ZJ ZJ ZJ ZJ 12 . ZJ ZJ 


qcqqtttqcq 

Zj Z3 Zy 23 3 


tattqqqcqc 


tcttccgctt 


cctcgctcac 


7740 


tgactcgctg 


cqctcqqtcq 

23 23 23 w ^* 23 


ttcqqctqcq 

*• *~ zj zj zj zj 


qcqaqcqqta 

Zj zj Zj zj ZJ 


tcagctcact 


caaaqqcqqt 

v* 23 23 23 23 


7800 


aatacggtta 


tccacagaat 


caqaqaataa 

zj i3 Zj zj w 


cgcaggaaag 


aacatgtgag 


caaaaggcca 


7860 


gcaaaaggcc 


aggaaccgta 


aaaaggccgc 


qttqctqqcq 

23 2? J? 2-? 23 


tttttccata 


ggctccgccc 


7920 


ccctgacgag 


catcacaaaa 


atcgacgctc 


aaqtcaqaqq 


tqqcqaaacc 


cgacaggact 


7980 


ataaagatac 


caggcgtttc 


cccctggaag 


ctccctcgtg 


cgctctcctg 


ttccgaccct 


8040 


gccgcttacc 


ggatacctgt 


ccgcctttct 


cccttcggga 


agcgtggcgc 


tttctcaatg 


8100 


ittcacqctqt 

1 


aggtatctca 


qttcqqtqta 


qqtcqttcqc 


tccaaqctqq 


qctqtqtqca 

23 23 23 23 


8160 


■aacccccc 


qttcaqcccq 


accgctgcgc 


cttatccggt 


aactatcgtc 


ttgagtccaa 


8220 


cccggtaaga 


cacgacttat 


cgccactggc 


agcagccact 


qqtaacaqqa 

23 23 2? 23 


ttagcagagc 


8280 


qaqqtatqta 


ggcggtgcta 


cagagttctt 


qaaqtqqtqq 


cctaactacg 


gctacactag 


8340 


aaggacagta 


tttggtatct 


gcgctctgct 


gaagccagtt 


accttcggaa 


aaagagttgg 


8400 


tagctcttga 


tccggcaaac 


aaaccaccgc 


tqqtaqcqqt 

23 Z? 23 37 23 


ggtttttttg 


tttgcaagca 


8460 


gcagattacg 


cgcagaaaaa 


aaggatctca 


agaagatcct 


ttgatctttt 


ctacggggtc 


8520 


tgacgctcag 


tggaacgaaa 


actcacgtta 


agggattttg 


gtcatgagat 


tatcaaaaag 


8580 


gatcttcacc 


tagatccttt 


taaattaaaa 


atgaagtttt 


aaatcaatct 


aaagtatata 


8640 


tgagtaaact 


tggtctgaca 


gttaccaatg 


cttaatcagt 


gaggcaccta 


tctcagcgat 


8700 


ctgtctattt 


cgttcatcca 


tagttgcctg 


actccccgtc 


gtgtagataa 


ctacgatacg 


8760 


gqagggctta 

^ZJ ZJ J J 3 ~* 


ccatctggcc 


ccagtgctgc 


aatgataccg 


cgagacccac 


gctcaccggc 


8820 


(■cagattta 


tcagcaataa 


accagccagc 


cqqaaqqqcc 

23 23 23 23 


qaqcqcaqaa 

23 23 23 23 


qtqqtcctqc 

23 23 23 23 


8880 


ciactttatcc 


gcctccatcc 


agtctattaa 


ttqttqccqq 


gaagctagag 


taagtagttc 


8940 


gccagttaat 


agtttgcgca 


acgttgttgc 


cattgctaca 


ggcatcgtgg 


tgtcacgctc 


9000 


gtcgtttggt 


atggcttcat 


tcagctccgg 


ttcccaacga 


tcaaggcgag 


ttacatgatc 


9060 


ccccatgttg 


tgcaaaaaag 


cggttagctc 


cttcggtcct 


ccgatcgttg 


tcagaagtaa 


9120 


gttggccgca 


gtgttatcac 


tcatggttat 


qqcaqcactq 


cataattctc 


ttactgtcat 


9180 


gcca xccgra 


agatgct'ttt: 


ctgtgactgg 


tgagtactca 


accaagtcat 


tctgagaata 




gtgtatgcgg 


cgaccgagtt 


gctcttgccc 


ggcgtcaata 


cgggataata 


ccgcgccaca 


9300 


tagcagaact 


ttaaaagtgc 


tcatcattgg 


aaaacgttct 


tcggggcgaa 


aactctcaag 


9360 


gatcttaccg 


ctgttgagat 


ccagttcgat 


gtaacccact 


cgtgcaccca 


actgatcttc 


9420 


agcatctttt 


actttcacca 


gcgtttctgg 


gtgagcaaaa 


acaggaaggc 


aaaatgccgc 


9480 
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aaaaaaggga ataagggcga cacggaaatg ttgaatactc atactcttcc tttttcaata 9540 

ttattgaagc atttatcagg gttattgtct catgagcgga tacatatttg aatgtattta 9600 

gaaaaataaa caaatagggg ttccgcgcac atttccccga aaagtgccac ctgacgtc 9658 
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